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Abstract 

The completion time for the dissemination (or alternatively, aggregation) of information from all nodes in a network plays a 
critical role in the design and analysis of communication systems, especially in real time applications for which delay is critical. 
In this work, we analyse the completion time of data dissemination in a shared loss (i.e., unreliable links) multicast tree, at the 
limit of large number of nodes. Specifically, analytic expressions for upper and lower bounds on the expected completion time 
are provided, and, in particular, it is shown that both these bounds scale as a log n. For example, on a full binary tree with n 
end users, and packet loss probability of 0.1, we bound the expected completion time for disseminating one packet from below 
by 1.41 log 2 n + o (log n) and from above by 1.78 logj n + o (log n). 

Clearly, the completion time is determined by the last end user who receives the message, that is, a maximum over all arrival 
times. Hence, Extreme Value Theory (EVT) is an appropriate tool to explore this problem. However, since arrival times are 
correlated, non-stationary, and furthermore, time slots are discrete, a thorough study of EVT for Non-Stationary Integer Valued 
(NSIV) sequences is required. To the best of our knowledge, such processes were not studied before in the framework of EVT. 
Consequently, we derive the asymptotic distribution of the maxima of NSIV sequences satisfying certain conditions, and give 
EVT results which are applicable also beyond the scope of this work. These result are then used to derive tight bounds on the 
completion time. Einally, the results are validated by extensive simulations and numerical analysis. 


I. Introduction 

Reliable data dissemination (one to many) and/or data collection (many to one), is highly important for a large variety 
of purposes. For example, Over-the-Air Programming (OAP) is an essential utility which enables data distribution such as 
application downloads, software updates, configuration settings, etc., by one central entity to many devices (e.g.. Deluge [1] , 
MNP [2] ,and MOAP [3]). OAP is becoming increasingly important with the expansion of Wireless Sensor Networks (WSN) 
and the Internet of Things, where the networks consist of hundreds or thousands of nodes. In addition, data collection, and 
specifically aggregating data packets from thousands of sensor nodes to a sink in large WSN’s, has numerous applications, 
e.g., habitat monitoring [4], microclimatic monitoring of a coastal redwood [5] and even event detection of an active volcano 
[ 6 ]. 

Setting up a direct link between the source to all its intended receivers for data dissemination or from each source to the 
sink for data collection is usually infeasible, impractical or highly resource-wasteful. For example, in WSN, setting up direct 
connectivity between the sink and all the sensors is usually impossible since sensors are distributed over a vast area while the 
sink transmission power is limited (i.e., the sensors have unlimited transmission power). Even when possible, such a set up may 
result in enormous energy waste due to the proportionality of the required radio transmission power to the squared distance, 
or in poor utilization, due to poor spatial reuse. Accordingly, typical networks will rely on multi hop routing. Furthermore, 
since setting up an autonomous multi-hop path (unicast) between the sink to all devices is also impractical and results in poor 
resource utilization, especially on networks where the redundancy and contention can dramatically impair performance [7], 
multicast tree-based data dissemination or collection is the common adopted solution both for data dissemination and data 
collection, respectively [8], [9]. 

Throughout this study, we focus on one to many reliable data dissemination (downstream traffic), in which a source has 
information it needs to distribute to all other users in the network. Nonetheless, all of our results also apply to data collection 
(ConvergeCast), namely, aggregating data from all nodes to the root of the tree, for a setup in which each intermediate node 
waits until all data coming from its children has been received successfully, before combining it into a single packet, and 
forwarding it to its parent. 

Multi-cast transmission is usually done in two phases: The setup stage, which establishes and maintains the multicast tree, 
determines the routing tree to all nodes, usually under some constraints [10], and the continuous stage, which broadcasts the 
information to all nodes in the tree. In this study, we focus on analyzing the second stage, and in particular, we investigate 
the completion time, which we define as the amount of time required between the start of a block transmission initiated at 
the source (or the end users), and its completion, i.e., successful decoding at all the receivers (or sink) for data dissemination 
(data collection). 

There are several studies in the literature which analyze tree-based reliable multicast. For example, [11] found a recursive 
formula for the distribution of the required number of single packet transmissions, until all nodes receive it. Specifically, 
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[11] assumed that the source which is the multicast tree root, retransmits the packet until it is received correctly by all 
receivers. Note that under this scheme, denoted as End-to-End ARQ mechanism, intermediate nodes only forward each 
packet they receive from their parents to their children, and do not initiate a retransmission of the packet. Extension to bulk 
data distribution utilizing Eorward Error Correction (EEC) codes, and specifically to C{n; k) erasure codes, in which receiving 
any k out of n packets is sufficient to decode the entire group is given in [12]. The benefit of granting each intermediate node 
the responsibility for reliable transmission of packets to its children, compared to end-to-end error control techniques was 
studied in [13]. Specifically, [13] extend the model suggested in [12] to capture also Link-by-Link ARQ and Link-by-Link 
EEC mechanisms, for which every node of the multicast tree is responsible for reliable delivery of packets it has received 
from its parent to its children, utilizing repeated retransmissions or rateless coding, respectively. Note the inherent difference 
between the end-to-end and the link-by-link error controls; for the former, finding the number of transmissions from the 
source is equivalent to finding the completion (delay) time, whereas for the latter this is clearly not the case. Delay is an 
important performance measure in many applications, for example in multimedia application such as distributed games, and/or 
Distributed Interactive Simulation (DIS), emergency response, vehicular communication and even financial information. 

In this study, we consider the problem of Data Dissemination in shared loss multicast tree with link by link error control, 
where the information is propagated from the source (root) to all nodes in the tree according to the following procedure: 
Each node, starting from the root, retransmits the packet it has received from its predecessor until it is correctly received by 
all of its successors. The process terminates when all the nodes in the tree receive the message. We assume that the channel 
between each parent and its children has packet loss probability p, independent of each other. Accordingly, the distribution 
of the number of transmissions, or time slots, needed for each child to receive the packet successfully follows a geometric 
distribution with parameter p. 

Clearly, the completion time which is defined as the number of time slots it takes until all nodes in the tree receive 
the message/s is determined by the time the message arrives at end users of the tree, and, in particular, by the leaf that 
takes the longest time to receive the messages. Since the completion time is a random variable, we explore the distribution, 
and in particular, the expected number of time slots that it takes until the last leaf of the tree has received the messages. 
Mathematically, denote by Ti the arrival time, that is, the time it takes leaf i to receive the message; we are interested in 
finding the distribution of the maximum of these random variables, denoted by M„, or formally M„ = max(Ti,... ,T„). 
We focus on large networks, comprising a large number of nodes (large n). Note that Mn can be defined recursively, 
M„ = maxi<i<if {Wi + where K indicate the number of children each node, being the completion time of 

the subtree which belongs to the child i, and Wi is the time to forward the message to the child i. Even though exploring 
Mn for low values may be possible, for large values of n, this becomes impractical. As a result, our analysis will focus on 
the scaling laws and asymptotic behaviour of Mn- We will, however show via simulations that our analytical results hold 
even for relatively small n, in the order of present-day networks. Another aspect that asymptotic analysis allows is examining 
the impact of increasing/decreasing system parameters, such as the number of nodes or the packet loss probability p on the 
completion time. Such analysis can be useful, for instance in determining the amount of sensors in a single tree or the power 
to invest in a transmission (which affects p), under the constraint of the completion time. Similarly, the structure of the tree 
(e.g., the number of offsprings) can also affect the completion time. 

The main tool that we will utilize is Extreme Value Theory (EVT), which is a branch of statistics that deals with the 
asymptotic distribution of the maxima of random variables. Even though EVT has been widely explored in the literature (e.g., 
[14]-[17]), two main challenges need to be addressed in the context of this work: Eirst, classic EVT considers continuous 
random variables, while we are dealing with number of time slots, hence discrete random variables. The second issue which 
is not less challenging, is that the random variables we are dealing with are correlated, i.e., are not i.i.d., and in particular, are 
not independent, which is due to the structure of the tree. Namely, for each two random variables, Ti and Tj, there might be 
some mutual channels where the message must go through, hence, times Ti and Tj might be correlated. Although there are 
several studies on this topic, the vast majority of them copes with stationary processes (for example [18], [19] for continuous 
and [20], [21] for integer valued). In our case, one can see that the process (Ti,..., T„) is non-stationary. Regarding the first 
issue, we adopt the technique initially suggested by Anderson [22], to bound the CDE of the discrete variables by that of 
two continuous random variables, and apply the rules of EVT on both. Eor the second issue, inspired by previous work, we 
formulate sufficient conditions in order to asymptotically bound the distribution of the maximum of Non-Stationary Integer 
Valued (NSIV) sequences. 

Note that the techniques used in this paper to analyse the dissemination time over trees can be used to extend known 
results for simpler topologies, e.g., [23]-[25], which considered the problem of disseminating k messages to n users, in an 
unreliable single hop broadcast network. Specifically, we find the asymptotic distribution of the completion time for the cases 
of K = Mlog 2 n, for any constant M as well as for k = o(logn) under certain conditions, extending the known results 
which were valid only for constant k and k = oj (n). 

Main contribution 

Our contributions in this work are thus as follows. 
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• We analyse the extremal properties of NSIV sequences, and in particular we find that under appropriate conditions, the 
limit distribution of the maximum of the sequence {Xi,... ,X„) has the same type as if the sequence were i.i.d. We 
believe that such results on NSIV sequences did not exist in the EVT literature. 

• The results on the extremal properties of NSIV sequences are applied to obtain a lower bound on the expected completion 
time, when the source disseminates one message, that is, until all the nodes in the tree receive the message. The lower 
bound is obtained by aggregating the channels of the upper part of the tree into one mutual channel for all users, which 
causes a decrease of diversity. We also derive a matching upper bound. The upper bound is obtained by increasing the 
diversity: assuming independent channels for all users. The bounds are tight in the sense that both upper and lower 
bounds predict the same scaling law, with a difference only in the constant multiplicative factor of the leading term. 

• In practice, it is important to know the number of time slots required in order receive the message at all nodes with 
a certain probability 1 — e. Hence, we also bound the CDF of the completing time, and as a result give an explicit 
expression for this number. 

• We generalize our result to M messages and K child nodes, where K might be a function of n. Furthermore, in some 
cases, we find tight bounds with the same constant multiplicative leading factor. 

• Finally, we derive a closed-form expression for the distribution of the completion time when broadcasting a file consisting 

of K packets to n users, in an unreliable single hop broadcast network, for k = Mlog 2 n and also for k = o(logn), k 
satisfying 0. This extends the known results in the area, valid only for larger values of k. 

The rest of the work is organized as follows. In Section II we give a thorough description of related works. Section III 
describes the system model and the problem formation. In Section IV, we review known EVT results which serve as a basis 
for this work. In Section V, we study the extreme value of NSIV sequences. In Section VI, we obtain the main results, for 
one message and full binary tree, where in Section VII we extend it to M messages and in Section VIII we study the case 
of K-ary trees. In Section IX, we present some numerical result and discuss them. Finally, Section X concludes the work. 

II. Related Work 

The analysis of the completion time for broadcast and multicast network models is well studied in the literature, under 
a wide range of assumptions and constraints. For example, Farley [26] considered the problem of constructing a minimal 
broadcast network, that is, a network where a message can be broadcasted with in a minimum completion time, regardless 
the originator, under the constraint that a member can not disseminate a message to more than one node at the same time 
slot. The author presented an algorithm to construct such networks with n members. In [27], the author extended it to m 
messages, and determined upper and lower bounds on the completion time in broadcasting m messages throughout a network 
with n nodes, assuming that a node can either transmit or receive, and an informed node can send to only one adjacent 
node at a single time slot. Slater et al. [28] presented an algorithm that determines the minimum broadcast time to deliver 
information from an arbitrary node to every other vertex in the tree. Koh [29] extended it to non-uniform edge transmission 
times. Proskurowski [30] characterized class of trees, called Minimum Broadcast Trees (MBT), and form an algorithm to 
determine whether a given tree is MBT. 

In all articles mentioned above, the authors assumed lossless channels in each point to point link. However, this assumption 
may not be realistic, due to the nature of wired and wireless channels. When losses are present, in order to guarantee reliability 
in the network, mechanisms such as Automatic Repeat reQuest (ARQ) or FEC are required. Starobinski et al. [31] analyzed 
the asymptotic performance of data dissemination, focusing on single-hop clusters and multihop cluster chains, assuming that 
the source is instantly notified when all nodes have received a given packet. Eryilmaz et al. [32] studied the delay performance 
under Possion arrivals, and disseminating (broadcasting) in a single hop network to multiple receivers, under two transmission 
modes: round robin scheduling and network coding. In particular, the authors found close forms and asymptotic expressions 
for the time required to transmit all packets at the head of line file to all the receivers, in a single hop, under these two 
transmission strategies. The authors also provided methods for extending the single-hop setting to multiple-hop networks 
with general topologies. In [33], the authors extended this model by considering delay constraints on the incoming traffic. 
Starobinski et al. [23] also considered broadcasting a file consisting k packets to n users in a single-hop wireless network (see 
also [34]), and compared the delay performance (completion time), between network coding and cooperative transmission 
schemes. In particular, the authors obtained the scaling law of the expected delay under randomized linear coding strategy, for 
large n and for the cases of fix k, «: = logn, and where k grows at least as fast as (logn)’’, r > 1. Since ARQ with a single 
source becomes impractical for large amount of users, the authors [24] exploited the advances of practical rateless codes (e.g., 
FT codes [35] and Raptor codes [36]), in order to implement a broadcast scheme which does not require an ACK after each 
reception, and a node can collect packets from a practically infinite stream. The paper analysed the completion time when 
disseminating a file to all receivers located at one hop from the source, by using EVT, including cases with heterogeneous 
packet loss probability. Similarly, Eryilmaz et al. [25] also considered the model of single hop networks, where the source 
is broadcasting a file consisting of k packets to n users, by performing Random Finear Network Coding (RFNC), for a 
time-correlated erasure channel model. The authors analysed the asymptotic throughput and delay, when k increases with 
n, and, in particular, characterized the distribution of the completion time in the cases where k grows faster than logn, i.e. 
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K = Lu(n). Note that besides the cases of constant k and k = oj(n), the asymptomatic distribution of the completion time 
is still missing. In our work, we extend this to the cases where k is a slowly increasing function of n, and for k growing 
logarithmic with n. Moreover, in our main result, we extend the scheme of the single hop to a scheme where the users are 
connected via K-ary tree. Nonnenmacher et al. [37] and Bhagwatt et al. [11] analysed the performance of reliable multicast 
trees, assumed end-to-end error control techniques, and in particular, found a recursive formula for the distribution of the 
number of transmissions until all nodes receive the packet. Note that in our work we are focusing on the completion time, 
rather than number of transmissions. Moreover, Herein, we derived bounds for the close form of the expected completion time, 
which enable us to examine the impart of parameters such as packet loss probability and multicast tree topology. Mosko et 
al. [12] considered FEC for a group of k messages, rather than analyzing the number of transmissions of a particular packet, 
using C {n,k) erasure codes. Ghaderi et al. [13] considered four types of of error control techniques, (end-to-end ARQ, 
end-to-end FEC, link-by-link ARQ and network coding), for a reliable multicast tree, and evaluated the expected number of 
transmissions under these four different error control schemes. In [38], the authors studied the asymptotic analysis of these 
four types, and also analyzed the performance of broadcasting to a set of K receivers over lossy wireless channels with 
different reliability mechanisms (ARQ, FEC, and Network Coding). 

Extreme Value Theory 

As EVT stands at the basis of asymptotic analysis of completion times, in this section, we give an overview of EVT results, 
starting for the classic results on i.i.d. processes, through various enhancements, to more complex stochastic processes. 

As noted below, EVT is a branch of statistic dealing with asymptotic distributions of the maximum of random variables, 
namely M„ = max(Ari,... ,Xn). For the i.i.d. case, Gnedenko’s theorem [39] (see also [40]) states that, if after a proper 
linear normalization, the distribution of converges to some non-degenerate limit distribution G{x), then G{x) belongs to 
either the Gumbel, the Frchet or the Weibull family. Loynes [41] studied the case where sequences are not i.i.d. but have a 
strong mixing property. Leadbetter [18] formulated a long range dependence condition D (un) of the mixing type, yet much 
weaker than strong mixing, and a local dependence condition D (un). In particular, Leadbetter showed that under these 
conditions, the limit distribution of the maximum is the same as if the stationary sequence were i.i.d. The author extended 
the results in [19] to the case where the local dependence condition D (m„) is omitted, and showed that D (un) is sufficient 
to guarantee the asymptotic distribution, where M„ has the same type as if the sequence were i.i.d., with a modification 
reflected in the normalizing constants. Leadbetter described a parameter called the extremal index, 0 < 0 < 1, which connects 
the extreme value distribution to the amount of dependence in the process. For a class of stationary sequences which do not 
satisfy condition D (m„), the author introduced in [42] a mild local mixing condition called D {un), and shown that the joint 
distribution of {Xi,X 2 ), on a stationary sequences, determines whether the extremal index exists, and computed its value. 
Chemick et al. [43] generalized the previous result by defining another local mixing condition called D^{un), and showed 
that under this condition, the extremal index can be calculated (and hence the distribution of M„) with the knowledge of 
the joint distribution of k consecutive variables. Smith [44] and Weissman et al. [45] studied the extremal index, for some 
stationary sequences. For the cases where it is hard to find a closed form for 0, an estimation can be found in [46] and [47]. 

For stationary normal sequences, Berman [48] stated that if the covariance sequence satisfies log n —> 0, then 
converges in distribution to the type I of extreme value distributions (Gumbel), which is the one which applies to i.i.d. normal 
sequences. Moreover, it was shown that Berman’s condition implies that conditions D (un) and D (m„) are satisfied, for 
a well know normalizing sequence Un. Mittal and Ylvisaker [49] extended it (see also [14, Chapter 6]), and discussed the 
maxima when pn —0, but /?„ log n is not tending to zero. 

EVT has also been extended to non-stationary sequences by Busier [50], [51], who formulated equivalent conditions to 
those of Leadbetter’s, including for non-identical distribution, and also for different boundary level for each i, i.e., = Uny. 

Regarding integer-valued sequences (discrete random variables), the results of the classic EVT do not hold because most of 
the discrete distributions (including geometric. Negative Binomial (NB), binomial and Poisson), do not belong to the domain 
of attraction of any of the three types of extreme value distributions. 

Anderson [22] was the first to introduce the study of extremal behaviour of some i.i.d. integer-valued sequences, by defining 
a class of distributions such that the limiting behaviour is bounded, under suitable conditions, by a non degenerate limit 
distributions. In his work, the author also analysed the behaviour of the maximum queue length for M/M/1 queues. Based 
on Andeson’s work, several stationary integer-valued models have been studied. McCormick et al. [20] analysed the limit 
distribution of a stationary sequence satisfying Leadbetter’s conditions D{un) and D {un), and applied it to first order Auto 
Regressive (AR(1)) NB and geometric processes, using an operation called binomial thinning [52]. Hall [21] extended his 
result to a family of stationary processes, satisfying conditions D{un) and D^{un), and applied it to Max AR(1) processes. 
Serfozo [53] considered the case of queue lengths in M/G/1 and GI/M/1 systems, and in [54] considered birth and death 
processes. Hall [55] extended the result of Rootzen [56] to Integer-valued Moving Average (INMA), where the marginal 
distribution tail of the sequence is regularly varying. This class of moving average models is defined by analogy with the 
continuous moving average processes, replacing multiplication by binomial thinning. Hall et al. [57] studied the extreme of 
sub-sampled IVMA models with heavy-tailed innovations, and in [58], the authors extended it to a periodic sub-sampled. 
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i.e., Yk = XgQ .)7 where g{k) is an increasing sequence with a periodic pattern. In [59], the authors considered random 
generated sub-sampled processes, and in [60], Hall et al. studied the limiting distribution of the maximum term of INMA 
when the innovations have exponential type tails. In [61], Hall et al. considered several integer-valued stationary models of 
MA and max-AR type, where the marginal distribution belongs to the domain of attraction of a max-semistable law, which 
means that instead of n observations, we look at a subsequence of observations, such that F*" [a„x -f &„] converges to 
a non-degenerate limiting distribution. Therefore, the authors obtained a well-defined limiting distribution for the maximum 
term, instead of limiting bounds. For the same class of max-semistable law, [62] considered the effect of missing values 
on the extreme value distribution. Regarding non-stationary sequences. Hall [63] analysed the extremal properties of T 
periodic integer-valued sequences, where the term periodic means that for each choice of integers ii <12 < ... < in, 
Pr (ATij ,..., XiJ) = Pr {Xi^+T, ■ ■ ■, and extended it [64] to the class of max-semistable law. Another family of 

processes that was studies is called triangular array. In this family, one usually assumes that one or more of the parameters 
are a function of n. Anderson et al. [65] studied the limit behaviour for i.i.d. Poisson distribution under sufficient conditions, 
where the mean A = A„. Nadarajah [66] extended it to other families of distributions, including NB, where p = pn. 

To the best of our knowledge, the study of NSIV sequences is still missing. In this work, to better analyse the completion 
time, we derive non-trivial extensions to some of the results above, and give an EVT theorem for such processes. It is 
important to note that all the articles mentioned above assume some dependence restrictions, (such as Leadbetter’s condition 
D (un)), and also local dependence conditions (like D' (un) and D (un)), in order to establish their results. We adopt similar 
techniques to obtain asymptotic results of the maximum of NSIV sequences. 

III. Problem Formulation 

In this work, we study the limiting behaviour of the completion time when disseminating a file consisting of M packets 
to all receivers, where all the receivers are connected via a multicast tree. 

In particular, assume that the source wishes to disseminate a file consisting of M packets over a multicast network via a 
full K-ary tree T„ = {V,E) with n leaves (|P| = ,\E\ = where each e € E models a point to point channel 

between a parent and its child, experiencing a packet loss probability p, independent between the channels. 

We also assume an ACK/NACK mechanism, such that it is sent instantly after each time slot, on a lossless channel (i.e., the 
packet loss probability for it is zero), with negligible transmission time. The time axis is slotted, and each packet transmission 
takes one time slot. Each node (which is not the source or the end user) waits to receive M different packets, and only 
afterwards switches to a transmitter mode (assuming half duplex), to send them to all its children. The node continuous to 
transmit until all child nodes get the entire file (i.e., M packets), and the process is terminated when all end users have one 
copy of the file. In order to streamline the process, we assume that the nodes use a rateless code, similar to [24], i.e., each 
node needs to correctly receive M distinct packets from the infinite stream sent by the parent node in order to recover a 
file. Once a certain node receives M different packages, it first decodes it to the original file, and then re-encodes it into 
a potentially infinite stream, and sends it continuously to all its children, until each child node accumulates M different 
packages. The process terminates when all nodes and specifically all leaves can successful decode the file. Note that when 
the file includes only one packet, there is no need to use rateless codes. 

Let Ti be the number of time slots needed for leaf i in order to receive M packets. Since each packet has to traverse n 
links before reaching each leaf node, then for M packets, one need Mlog^n successes, one after the other. Accordingly, 
the distribution of Ti follows NB with Mlog^^ n successes with a probability of failure p, i.e. Ti ~ NB{\og 2 ^jp). 

We denote by the completion time to successfully disseminate the packet to all nodes, i.e., Mn = max(Ti,... ,T„). 

Our goal is to study E [Mn], namely the expected number of time slots that are needed until the packet is sent to all nodes 
on the tree. We focus on the case where n —> 00 , which practically means that the results hold for sufficiently large n. As 
mentioned above, we start with the case of M = 1, and K = 2, i.e., full binary tree, and in Sections VII and VIII we will 
extend it to general K and M, including large K and/or M. 

IV. Preliminaries 

In this section, we review the relevant studies in EVT, including relevant methods and conditions. We also characterise 
properties of the NB distribution, which, as noted earlier, is the marginal distribution of the sequence (Ti,... ,T„). 

Throughout the work, M„ will refer to the maximum of n random variables, where the actual random variables will be 
clear from the context. 

A. Extreme Value Theory 

As mentioned above, the main tool to analyse the problem at hand is EVT, which concerns the asymptotic distribution of 
the maxima of random variables. EVT is well studied in the literature for a wide variety of stochastic processes. Here, we 
briefly review the results of the classic EVT, namely, specify the possible forms of the limit distribution of the maxima in a 
sequence of continuous valued i.i.d. random variables. Note that as the problem in this work involves NSIV sequences, the 
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results below will not be applicable. Hence, a large part of this work (Section V) is devoted to deriving the needed results. 
Yet, we include the results below since they serve as a basis and reference for the rest if the work. 

Theorem 1. [14, Theorem 1.5.1 ] Let {Xi,..., X„) be an i.i.d. sequence, and M„ = max {Xi,..., X„). Let 0 < r < oo 
and suppose that {««} is a sequence of real numbers such that 

n{\ — F {un)) ^ T, as n ^ oo. (1) 

Then, 

Pr(M„<u„)^e-". ( 2 ) 

Note that if is of the linear form of = a; + 6 „, namely, is a shift of x by bn, then t = t{x). Furthermore, if 
T has the form t = a“, a G M, then the distribution of converges to a Gumbel distribution, with mean p = — 
where 7 « 0.57721 is the Euler-Mascheroni constant, and log refers to the natural logarithm (base e). Under mild technical 
assumptions [17], the convergence in distribution implies also moment convergence. Hence 

lim E [M„] -bn= pL. (3) 

n—¥oo 

The reasoning behind Theorem 1 is simple: The continuity of F enables us to assume that the limit in eq. (1) exists. Hence, 
the probability of one random variable being less than is approximately 1 — 7 . Using the assumption of i.i.d. random 

variables we get that the probability of n random variables being less than Un is approximately (l — ^)”. Taking the limit 
n —> 00 results in eq. (2). However, as pointed out above, the sequence (Ti,... ,Tn) is neither independent nor continuous 
valued. The assumption of continuous values is essential because the limit in eq. (1) may not converge if F is the distribution 
of a discrete random variable. Therefore, we say that F does not belong to the domain of attraction of G, where G{x) 
is the non degenerate limit CDF. Anderson [22] was the first to introduce the study of extremal behaviour of some i.i.d. 
integer-valued distributions, by defining a class of distributions such that the limit behaviour is bounded, under suitable 
conditions, by non degenerate limit distributions. Specifically, the main idea is to define a continues function Fc{x), such 
that Fc{x — 1) < F\x\ < Fc{x), for all x, where both Fc{x — 1) and Fc{x) satisfy eq. (1) for some This allows us to 
derive bounds on the limit distribution of the maximum of some integer-valued processes. Further details, including how to 
find Fc (x), will be presented in the next section. 

The next challenge to be dealt with is that the sequence {Ti,... ,Tn) is correlated. Several works have considered this 
topic, for stationary processes (discrete and continuous), for continuous non-stationary processes, and also for random fields. 
All were inspired by the works of Feadbetter in [18], [19]. In his papers, Feadbetter gave two main conditions. The first is 
a weak “mixing condition”, restricting long range dependence, called condition D (un). There are several versions of this 
condition, suitable for a variety of stochastic processes. In our case, since we are dealing with a non-stationary sequence, we 
present the version of [50] (see also [51]), who studied the extreme value of a non-stationary continuous valued sequence, 
with the assumption of non-identical marginal distribution F = Fi, and boundary level Un = Un,i. 

Condition {D {un))‘ The condition D (un) will be said to hold for the sequence (Xi,..., Xn) and {u„}, if for any integers 

1 < ii < ■ ■ ■ < ip < ji < ... < jq < n, (4) 

for which ji — ip > I, L = {ii, I = 1, ■ ■ ■ ,p}, J = {ji, I = I, ■■■, q}, M (I) = {Xi < i G 1} and similarly M (J), We 
have 

sup |Pr (M (J n J)) - Pr (M (/)) Pr (M (J))| < a„,/, (5) 

i,.j 

where an,i„ —> 0 as n —)■ 00 for some sequence („ = o (n). 

Note that condition D (un) is significantly weaker than strong mixing, as it considers only a certain sequence Un. Generally 
speaking, it ensures that the random variables whose indices are sufficiently separated, are asymptotically independent. 
Denote 

n 

and split the set {1,...,n} into sets of consecutive integers -fi,/ 2 ,... in the following way: Let Ii = {1,...,ii} be 
such that 

ii 

^Fi{Un,i) < Fn/r, (7) 

2=1 
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and 


* 1+1 

Fi{Un,i) > Fn/r. (8) 

i=l 

Namely, ii is chosen as large as possible such that the sum is still smaller that Fj^jr. In the same way, I 2 = +1,..., ^ 2 }, 

such that 

i2 

F^ < Fnjr, (9) 

*=* 1+1 

and so on. As a result, we get r sets of consecutive integers, such that v < n. Note that if we assume identical marginal 
distribution F, and identical boundary level t 6 „, for all i, we have |/i| = I/ 2 I = ... = |/r|. Then it was proved in [50, Lemma 
2.6] that if D {un) holds and 


limsupF^ < 00 and liminfF* > 0 , 

n 


( 10 ) 


then 


lim sup 

n—>-oo 


Pr (M„<u„)-P[Pr (M (/,)) 
1=1 


= 0 , 


( 11 ) 


for r fixed, and the sets {/;} defined as above. 

The second condition is a local condition, restricting the possible of clustering of high level exceedances, and it has various 
forms. For example, Leadbetter [18] refereed to it as condition D (un)- 

Condition (^D (un)'j '• The condition L> (un) will be said to hold for the stationary sequence {Xi,..., Xn) and the sequence 
{Un} if 


lim sup n Pr (Xi 


> Ur. 


, Xj > u. 


i) —>■ 0 , as k ^ 00 , 


i=2 


( 12 ) 


where [*J denotes the integer part. 

Condition D (un) limits the probability of more than one exceedance of the level among subset of {Xi,... 

and it has a critical role in finding the extreme value distribution of stationary sequences whose local dependence is not “too 

strong”. 

It was shown in [18] that if both conditions D {un) and D (un) hold for the stationary sequence {Xi ,..., Xn), then the 
limit distribution of the normalized M„ behaves the same as if the sequence above were i.i.d., under the same normalizing 
constants. Yet, our sequence is non-stationary (and also integer-valued). Hence, in Section V, we formulate a suitable condition, 
such that together with D (un), it enables us to asymptotically bound the distribution of the maximum of NSIV sequence. 


B. The Negative Binomial Distribution 

Let Y be a random variable representing the number of Bernoulli trials needed to get one success, for a probability of 
failure p. Therefore, 

Pr(Y = fc) =/-yi-p), A: = 1,2,... (13) 

Y has a geometric distribution with parameter p, i.e. Y ~ Go (p). Now, denote the random variable X = 'Fi, i-C-, X is 
a random variable representing the number of Bernoulli trials until the m’th success occurs, where to is a fixed integer. X 
has a NB Distribution with parameters to and p, i.e., X ~ NB{m,p). 

For a random variable X ~ NB{m,p), denote the Probability Density Function (PDF) /[a;|TO,p] = Pr (2f = x), where ( 
[67], [ 68 ]) 

f[x\m,p]=(^ ^ — p)™ a: = TO,TO+l,... (14) 

\m — 1 / 

and the expected value E [26] is given by 

777 

E[X] = --. (15) 

1 -p 

Let F[a;|TO,p] be the Cumulative Distribution Function (CDF), i.e., Y[a:|TO,p] = Pr (AT < x) : 

F[x\m,p] = Ii-p{m,x — m + 1), (16) 
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where Ix{a, h) is the regularized beta function, that is, 

Ix{a,b) 


(17) 


B{x] a, h) 

B{a, b) 

B{1 — x;b, a) 

and B{x;a,b) is the incomplete beta function, while B{a,b) is the complete beta function, i.e., 

B{x;a,b) = f 
Jo 

B{a,b) = f 
Jo 

B{a,b) has several other forms, including 

M _ na)m 

where r(a;) is the Gamma function, which follows the recurrence relation 

r (x + 1) = a;r (x). 

Furthermore, if x is a positive integer, then r(x) = (x — 1)!. 

Also, B(x;a,b) has other forms, including 


6-1 


B(x;a,b) = x“^(-l)* 


i=0 


b — X 


i j a + i 


and (see [69, pp.994]) 


x^ (1 — x)^ 

B(x; a, b) = - F{a + 6,1; a + 1; x) 


The hypergeometric function F{a, 6; c; z) is defined by the Gauss series 

, . . ab a (a+ 1)6 (6+1) 2 

FMc;z) = l + -z+ + 


(18) 

(19) 

( 20 ) 

( 21 ) 

( 22 ) 

(23) 


V. New Extreme Value Results eor Non-Stationary Integer-Valued Sequences 

In this section, we study the limiting distribution of the maximum term of NSIV, in order to apply it in the next section 
to obtain bounds on Mn- As mentioned earlier, the results of the classic EVT mainly concern continuous random variables. 
Anderson [22] was the first to introduce the study of extremal behaviour of some i.i.d. integer-valued distributions, by 
defining a class of distributions such that the limiting behaviour is bounded, under suitable conditions, by a non-degenerate 
limit distribution. Based on Andeson’s work, several stationary integer-valued models have been proposed [20], [21], [53]- 
[55], [60], however, recall that in our the case, the sequence is not stationary. Hence, in this section, we derive the limit 
distribution of a maxima of a NSIV sequences, under some restricting conditions. 

Eollowing Anderson’s work [22], define a continuous distribution function Fc as follows: Eor F € D (namely, F belongs 
to the domain of attraction of one of the nondegenarate limit distributions), and for each positive integer n, let 

h[n] = — log(l — F[n]). (24) 

Now, define he to be the extension of h by linear interpolation, i.e., for x > 1 

hc{x) = h[[x\] + (x - [xj)( 6 ,[[x + IJ] - 6 ,[[xJ], (25) 

where [xj is the integer part of x. Then, set a continuous CDE, Fe{x) = 1 — e Note that F^ix — 1) < F[x\ < Feix), 

and for any integer n, Fc{n) = F\n]. Now, the choice of 6 „ should be such that 

Feibn) = (26) 

n 

where F denotes 1 — F. As mentioned above, in order to establish the extreme value result, one needs to show that both the 
long range dependence condition (mixing) and the local dependence condition hold. 

Here We will use the technique developed in Section IV, and split {1,... ,n} into set of consecutive integers /i,... ,Ir„ 



where 


^1 — {!) 2 ,..., kn} 

I 2 = {kn + 1) • ■ • , 2fc„} 


(27) 


7 r„ — {(^n l)^n “t“ I5 ■ ■ ■ 5 ^n^n}■ 

Namely, each set includes exactly fc„ indices, thus r„ = Regarding the long range dependence, we will use condition 

D (un) as presented in Equation (3), which implies that we can assume that eq. (11) holds. However, here, the relation 
between n and r has to satisfy certain conditions, hence, we will obtain a somewhat more general version of eq. ( 11 ) result. 
The next lemma proves that if condition D (un) holds, for NSIV sequences, then eq. (11) holds, yet r may also be increasing 
sequence r„. 

Lemma 1. Assume D (un) is satisfied by the NSIV sequence {Xi,..., Xn), with identical marginal distribution, and for 
some identical boundary level Un such that 

limsupnPr( 2 f > u„) = r. (28) 

n—¥oo 

Let Tn be a sequence satisfying 

rnCtn,i„ -> 0 and rj— ^ 0, (29) 

n 

as n —)■ oo, where In, o:n,i„ ore given in condition D (un)- Then, eq. (11) holds for r = Cn- 

Proof: See Appendix A. ■ 

For the local dependence restriction, we now present condition Dj. (un) as follows. 

Condition (un)^'- Let r„ = satisfy eq. (29), and let /i,... ,Ir„ be disjoint sets of {!,... ,n}, as defined in 

eq. (27). Condition D/, (un) will be said to hold for a non-stationary sequence {Xi,... ,Xn) and sequence {u„} if 

max V' Pr {X^ > Un, Xj > Un) < «„ , (30) 

II: ' ^ ’ 

l<l<r„ 

where 


lim sup r„a„ = 0. (31) 

n—^oc) 

Note that Df. (un) is stronger, yet simpler, compared to the condition D presented in [50] and [51], in the sense that 
(u„) herein is harder to achieve, yet easier to check. In [50], [51], the condition D allows to omit a negligible number 
of elements from the set //. More detailed explanations will be given after the proof of the next theorem. 

In the next theorem, which is the main result on this section, we prove the existence of a limit distribution of the NSIV 
sequence {Xi, ..., 2f„), under conditions D {un) and Df. (un). 

Theorem 2. For the NSIV sequence {Xi,...,Xn^ with identical marginal distribution function F[x], let Mn = 
max(2fi,... ,Xn). Then, if conditions D (un) and 79^ (u„) hold for Un = x + bn satisfying 

lim nFc{un) —>■ r (32) 

n—^oo 

lim nFc{un — 1 ) —> r , 

n—^oo 

for 0 < T < T < oo, then 


limsupPr (M„ < Un) < e , (33) 

n—>-oo 

liminfPr(M„ < tt„) > . 

n—¥oo 

Proof: The proof is based on the proof in [14, Theorem 3.4.1 ], yet note that the latter was proved for stationary sequences, 
with a continuous marginal F. Here, we extended it to a discrete marginal F, by bounding it with a continuous marginal F^, 
and, also, generalize it to non-stationary sequences. 

First, for the sets Ii,..., Ir„ defined above, recall that 


M (Ii) = {X, < Un,Vi & Ii}, l = l,...,rn 


(34) 
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Namely, the event M (Ii) implies that the maximum from the set f is less than Un- Now, let (/;) be the complementary 
event, hence 

ih) = U {X, > u„}. (35) 

Thus, by the Bonferroni inequality, we have 

>u„)- Pr(X, M„)<Pr(M^ (/,))< ^Pr(X, >u„). (36) 

j&Il j&Il 

Writing Si = Pr {Xi > Un,Xj > Un), and noting that Pr (Xj > Un) is identical for each j, we have 

l-knPr (X, > Un) < Pr (M (/;)) < 1 - Pr (X, > u„) + Si. (37) 


Fc (Un) < Pr (Xj > Un) < Fc {Un - 1) , 


1 - knFc {Un -1) < Pr {M (/;)) < 1 - knF^ (m„) + S'/. 

Also note that by condition Dj. (un). Si = o for each 1. Thus, we can write eq. (39) as follows 


nFc (m„ - 1 ) 


rr(M(/,)) <!-=%>+Si) ^ 

n/kn \rnj 


Multiply eq. (40) over each Ii. Since we assume that D (un) holds, we can use eq. (11), i.e.. 


hence, we have 


nFc (un - 1 ) 


J]Pr(M(//)) =Pr(M„ <M„) + o(l), 


< Pr (M„ < Un) + 0 ( 1 ) <(l - + o ( — 


and recall that r„ = Now, in order to complete the proof, we use the following claim. 

Claim 1 . Let Um be a sequence such that Um a, as m ^ 00. Then 

lim (l+^^)’" = e“. (43) 

m—>00 V m y 

We give the justification with more details in Appendix B. 

Using the claim with m = once with Um = nFc (un — 1), and once with Um = nFc (m„) + o(l), taking n —)■ 00, we 
have 

e”’’ < liminf Pr (M„ < m„) < limsupPr(M„ < Un) < e”’’. (44) 


Which completes the proof. 


We may deduce at once from Theorem 2 that under conditions D (m„) and Dj. (un), the limiting distribution of M„ is 
the same as that which would apply if {Xi ,..., 2f„) were i.i.d. 

Observe also that Theorem 2 does not apply to all non-stationary sequences, since we assume identical marginal distributions. 
Moreover, as noted before, condition (m„) may be too strict, such that there might be a sequence that fails to satisfy 
this condition, yet the limit distribution of the maximum is precisely the same (including the same normalizing constants), 
as if the sequence were i.i.d. For example: consider the sequence {Xi,... ,X„), where in one of the sets //, there are s/kf 
elements (out of elements) who are identical (i.e., Xi = X 2 = • • • = 2fy^), while the rest are i.i.d. Then, one might 
check that Df. (t6„) fails, yet we have n — s/kf ~ n independent elements, which implies that it is possible to obtain the 
extreme value result as in Theorem 2. Hall [50], [51] addressed this issue, and formulated a mild condition, such that it 
removes ’’unimportant” elements from the set //. 

Another possible extension, similar to the stationary case [42], [43], is to study extreme value of processes who have a non 
negligible local dependence. 

In the next section, we apply our result to obtain bounds on the distribution of M„ in our problem. 
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VI. Asymptotic Analysis of the Completion Time 

As we can see from Theorem 2, D (u„) is essential in order to characterise the asymptotic distribution of M„. In our case, 
the correlation between two sets of random variables on the sequence (Ti,... ,T„) is expressed by their mutual channels. 
For instance, the pair {Ti,Tm) share in common log 2 n — [log 2 m] (log 2 — if m is a power of 2) channels. Therefore, in 
most cases, the number of mutual channels increases with n. As a result, the dependence between two sets is not negligible, 
even for large n, and consequently, D (un) does not hold. Nevertheless, we derive upper and lower bounds for the expected 
completion time, i.e., E [M„], where the lower bound is achieved by modifying the topology of the tree, such that we can 
analyse it using the tools we derived ,and the upper bound is achieved by considering independent channels. 


A. Lower bound 

In order to derive a lower bound, we split T„ vertically into two parts at level log 2 kn, where log 2 S 1,2,..., log 2 n—1, 
such that the first part is a binary dissemination tree, T.^, and the second part includes ^ replications of sub-trees 

denoted by ,..., (see Fig. 1). 

Let Yj ' be a random variable representing the number of time slots it takes to forward one packet from the root of the 
subtree to the leaf j, 1 < j < In addition let = max , Y ^^'^, ■ • ■, . thus, Frepresents the completion 

time of the subtree Clearly, the number of time slots it take to forward one packet from the root of the tree T„ to one 

" (i) 

of the end-user is the time it takes to forward the packet to the root of corresponding subtree , plus the time it takes to 

forward it from the root of to the leaf. Formally, if we denote by Wi, 1 < i < njkn, the number of time slots needed 
to forward the packet to some intermediate node in T„ at level log 2 kn, then the completion time needed to disseminate a 
message to all nodes of T„ can be represent as 

Mn = max -I- 1 < i < n/k^ . (45) 

We construct the lower bound as follows: First, we will calculate the distribution of F = max (F^^^ F^^^ ..., by 

using EVT, which enables us to find the expected value of it. Then, in the next stage, we will add the expected value of W, 
where IF is a random variable, which represents the time slots it takes to forward the message from the root to a node X 
(see Fig. 2), which is located at distance of log 2 n/kn hops from it, thus IF ^ NB (log 2 n/kn,p). In fact, the only change 
that has been made is that we unite all the paths with length of n/kn edges, starting from the root, into a single path, from 
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Fig. 2: Illustration of the construction of the lower bound. Here we can see that a message should pass through log 2 n/fc„ 
nodes, represented by a random variable W until it gets to node X, and afterwards, the message is disseminated to each one 
of the subtrees ..., 


S to X. This leads to decrease in the channels diversity, or in other words, increase the correlation of (Ti,... ,r„), which 
may give an insight about the acceptability of the lower bound. 

Our first result, which is a basic lower bound, follows from the observation that for each realization, we have Ti < 
max {Ti ,..., Tn), for each i. Hence, we obtain a trivial lower bound on E [M„], by just considering the expected value of a 
single NB random variable. Therefore, by eq. (15), we have the following claim. 

Claim 2 . 

(46) 

1 -p 

However, as we will see in the simulation results, this bound can be quite loose, and tighter bounds are necessary. 

Formally, define = W + Y. Next, we show that E [M„] > E [iW^], and afterwards, we obtain an asymptotic expression 
ofE[M']. 

Lemma 2 . 

E [M„] > E [M^] . (47) 
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Fig. 3; Example of the partition of T„ into two identical subtrees, and which are connected via a single channel, 

Wi and W 2 , respectively. 


Proof: Denote i* = argmaXj 1 < i < n/kn), then 


E [M„] = E 


max iWi-\- 


> E 




= E [fF + F] 

= E [Ml] , 


where the inequality results from the fact that max^ {Wi + F^*)) > + F*--^) for each j, including j = i* 


(48) 


An intimidate conclusion from the proof of Lemma 2 is that the bound would be tight if i* = argmax, (Wt + F^®)). 
In other words, the end user that was the maximum from all subtrees T^, , T^, ,..., is also the maximum from 

T„. Consequently, in order to tighten the bound as much as possible, we would like to choose /c„ as large as possible. For 
example, if = ^, we get two identical subtrees ^nd (see Fig. 3), that are connected to the root of T„ by a 

single channel, IFi and W 2 respectively. Here, we can figure out mat we have the tightest possible lower bound. However, as 
mentioned earlier, the distribution of F will be evaluated via EVT, and specifically, by Theorem 2 we derived in the previous 
section. Hence, we need to further find restrictions on fc„, such that it would be possible to apply Theorem 2, in order to 
derive a limit distribution for F. 


We would like to mention that if is a constant, the sequence (Fi,..., F„) meets the definition in Hall [63]. Namely, for 
each subset {Yi ^, ■ ■ •, F;^}, the distribution of ( 1 )^,. ■., F;^) and , F^+t) are identically distributed, for a final 

period T = kn- Accordingly, the extreme value distribution for the sequence is easily found. However, as discussed earlier, 
kn should be as large as possible, and, in particular, such that —)■ c» as n —>■ 00. 

Conditions D [un) and Dj. (un) are essential in order to derive the asymptotic distribution of the normalized F. Regarding 
condition D (un), note that by the construction of the lower bound, f/*®^ and f/*^^ are independent if ii 7 ^ 12 (since they do 
not belong to the same subtree). Therefore, we say that the sequence {F^^* } (we write from now, for simplicity, (Fi,..., F„)) 
is kn dependent, in the sense that we can divide the sequence (Fi,...,Fji) into sub-sequences with size kn, which are 
independent of each other. As a result, similar to Section V, let divide the indices into sets, /i,... ,/r„, where = ^. 
Then, we have 

r-n 

Pr{Y <Un)='^Pr{M{Ii)). (49) 

In particular, one can see that eq. (11) holds. Hence, if r„ is an increasing sequence, condition (un) is sufficient to 
guarantee (for a proper choice of Un) asymptotic distribution of the normalized F. Furthermore, note that by the constmction 
of Tk„, it can be verified that a sufficient condition for (un) is n ("^1 ^ ttn, Xi > Un) —>■ 0 , as n —)■ 00 , which 

is exactly the condition D (u„), presented in eq. (12). Moreover, (Fi,Fi) are independent, for i > (since they do not 
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share mutual channels). Hence, if the assumption in eq. (32) holds, we have: 

kn kn 

n Pr {Xi > Un, Xi > Un) = n O 

i=k„/2 i=kr,l2 



for kn = o [n). Hence, for the sequence (Yi, condition (t6„) may be written as 



k„/2 

Dk^ {un) '■ limsupn Pr (2fi > Un,Xi > m„) = 0. 
1=2 


( 50 ) 


(51) 


Note that since we are dealing with a practical information dissemination problem, namely, n is finite in real life, we should 
consider also the rate of convergence of the result in Theorem 2, and in particular, the choice of Un which affects on the 
convergence in eq. (32). Moreover, the size of (and thus also r„) also has a great impact on it, and this can be seen in the 
transition from eq. (42) to eq. (44), and also by the convergence of eq. (51). Namely, there is a direct relationship between 
the choice of and the rate of convergence, that is, as kn grows, then the convergence rate slows down. As a result, one 
can see that there is a trade off between the tightness of the lower bound and the rate of convergence. This will be illustrated 
via simulation results in section IX. 

Next, we find to satisfy eq. (32) and restrictions on such that (m„) would hold. First, define a continuous CDF 


Fc {x\mn,p) 

for X > rrin — 1, and by the definition of F). we have 


B{x - mn + 1, Wn) 
B{p;x -m„ + 1,TO„) ’ 


(52) 


Fc (x\mn,p) <PriX>x)< Fc {x - , 


(53) 


for some random variable X ^ NB The first lemma finds an asymptotic approximation for Fc (a:|m„,p), that will 

be needed in order to find an appropriate bn satisfying eq. (26), and also to find an asymptotic approximation for the joint 
distribution of (Yi,Yj). 


Lemma 3. Let 


where 


1 — P 

4>{n) = a log 1 n + (m„ - 1) log i ( -T'a (m„) ) - log i {rrin - 1)! + a;, 


{rrin) = a log i n + (to„ - 1 ) log i 


1 -F 
P 


(54) 


(55) 


for 0 < a < 1, (we write for simplicity T*! =4*). Assume the integer sequence = o (logn) is a slowly increasing function 
of n, (and note that T'q {rrin) a function of rrin, ond not for n). In addition, let 


,, nr=v'(</'W-j) 

e (n) = 


Then, we have 




Fc {f (n) \mn,p) = —e(n) 1 + 0 


/ rrin 
\logn 


(56) 


(57) 


Note that 4>{n) depends on x. Moreover, if e(n) —)■ 1, as n —)■ oo, we have 


Fc {f (n) \mn,p) = ^ (1 + o(l)) • 


(58) 


Proof: See Appendix C. 


Remark 1. To understand how to choose rrin such that eq. (56) is satisfied, note that 

njL"o”^ {rtr{rr)- j) ^ ( fi{n) 


(T'a {rrin)) 


^'a {rrin) 


( 59 ) 


\ '^a{mn)J 
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where 


A(n) = (m„ - l)logi (5'a (m„)) - logi (m„ - 1)! + x. 

P P 

Using the fact that 1 + x < e“, we can write 

nr=o”'w«)-j) 


(5'a {mn))'' 


Hence, a sufficient condition for choosing mn is 


< exp 
= exp 


A(n) 

(Wn), 

O {mn log (^g (Wn))) rrir, 

(W„) 


(60) 


(61) 


log log n ^ p 


(62) 


n->oo log n 

Remark 2. By the proof of Lemma 3, it holds for any x satisfying eq. (56), hence it can be concluded that a sufficient 
condition for x is that x = O {mn log log n). Furthermore, if x < 0, but still f (n) = O (logn), then 


4> (n) = 4> (n) + X < f (n). 

Hence, if ([) (n) satisfies eq. (56), we get that for large enough n, 

Fc{(l){n)\mn,p) < 

This will be used in order to show the asymptotic approximation of the joint distribution. 
Proposition 1. Let Un {x) = x + &„ (for simplicity Un), where 

1 -P, 


(63) 


(64) 


bn = logl n + (log 2 kn - 1) log 1 

p p 

We have the following result. 


P 


4' (loga kn) - log 1 (log 2 kn - 1)!. 


lim nFc (m„| log 2 fc„,p) = p®. 

Proof: The proposition follows immediately from Lemma 3, with a = 1, and mn = log 2 kn. 


(65) 


( 66 ) 


An immediate conclusion from Proposition 1 is that if (Yi,... ,y„) would be i.i.d., then eq. (33) would hold for t = p^ 
and r = p^~^. We should pay attention that for constant log 2 this lemma was already found in [24], and also for a 

slightly different in [20]. However, here we extend it to the case where log 2 may also be an increasing sequence of n, 
where log 2 kn satisfies eq. (56). As we saw earlier, a sufficient condition for is 


(loga fcn)^loglogn 

n->oo log n 


(67) 


Consequently, the results of [24] remains true also for the case where the file consisting of k packets, where k is an increasing 
sequence of n satisfying —?• 0, as n —> oo. In the next subsection, we will extend it for the of k = log 2 n, and in 

Section VII, we will generalize it for k = M log 2 n, for some constant M. 

Observe that for log 2 kn = 1, we get that the sequence (Yi,..., Y„) is i.i.d. following geometric random variable. Thus, 
by Theorem 2, it is possible to obtain asymptotic approximation of Y, namely for large enough n, 


e ^ < Pr (Y < 6„ + x) < e ^ , 


( 68 ) 


with bn = logl n. Therefore, the distribution of Y — 6„ is bounded from both sides by the CDF of a Gumbel distribution.' 

P 

Hence, according to eq. (3), 

E[Y]>5„-^, (69) 

logp 

which implies that we can derive an immediate lower bound on E [M^] as follows. 


Claim 3. 


E [Mn] > 


1 

1-p 



_J _ 

logp 1 — p 


( 70 ) 


'Note that this is result is a well known results in the literature. 


15 



Proof: 


E [Ml] =E[Y + W] 

= E [y] + E [W^] 

, 7 logofn) — 1 

> log! n - -H-^- 

P log p r — p 


1 1 

1 -P l 0 g 2 P 


log 2 n - 


7 _ 1 

logp 1-p' 


The lower bound in the claim above has a scaling law of ^ ~ Iog~p) However, as discussed earlier, this bound 

can be improved by setting kn as large as possible, which will cause dependence between the variables in the sequence 
(Yi,... ,Yn). Hence, next, we show that condition Dk„ (wn) holds with kn satisfying eg. (67), hence we can further tighten 
our bound. 

First, in the next lemma, we find an asymptotic bound for the joint distribution. This bound will be used in order to prove 
that condition (u„) holds. 

Lemma 4. For i = 1, 2,..., log 2 kn — 1, we have 


Pr(yi > Un,Y 2 i > Un) 


for some h < a < 1, where 


Proof: See Appendix E. 


< 2^ iCA> (log2 log2 (1 + o (1)) (72) 

n \ 1 — p ) 


C = max 


logi n’ T- (log 2 fc„) ) ■ 


We may now give the key result on {Yi ,..., Y„), which will enable us to reason on its properties for large n. 
Lemma 5. The sequence (Ti,..., Y„) satisfies condition Dj, (un). 


Proof: Let 


fen /2 

= n ^ Pr (Yi > Un, Yj > Un) 


Then, by eq. (51), condition D^, (u„) is satisfied if an 0, as n ^ oo. 

As mentioned before, we may write the joint distribution Pr (Yi, Yi) as Pr (W + Zi,W + Z 2 ), where W is a random variable, 
representing the shared branches of the tree (dependent channels) o/Yi, Yi and Zi, Z 2 are the independent branches (channels) 
of Yi, Yi, respectively. 

Therefore, we partitioning all the pairs (Yi, Yf ), i G {2,..., A:„/2}, according to the number of shared branches. The number 
of pair with log 2 kn — i shared branches and i independent branches is 2 ®“^. 

As a result, we can write an as 

kr,l2 

an =n ^ Pr (Yi > u„, Yi > m„) (75) 

i^2 

log 2 k-n — l 

=n ^ 2®“^ Pr (Yi > Un, Yai > Un) 


Using Lemma 4, we have 


log2fe„-l , / A \ 

2 * • f - (log2 knj 


= {C^ (log 2 kn)) 


log 2 fen -1 


log2 fcn-1 


log2 fe„-l 


Y c—y: 2 - 
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Let 


g{n) = log 2 kn, and note that g(n) = O ^ ^ ■ Hence, by eq. (67), g{n) —)■ 0 ai n —)■ oo. Then 

log2 kn — 1 

Oln (logs ^n)f 


. fc„-l 


E (»(”))' + 5 : *"■ 


2=1 


2 n 2 


= (CVI/ (log 2 (1 - g{n)t^^ 


(logzfcn))' 


-5(«) 

\l 0 g 2 fc,, —1 dijt) 1 kn 

2 n 2 <^-i ^ l-g{n) 2n2“-i 

Atow, as n ^ 00 , clearly i^g^n) addition, note that {C^ (log 2 ^ 

logfcn — (2a — 1) logn —>■ — 00 , 


1. Moreover, note that. 


>0if 

(77) 


and as noted before, by the choice of kn, 0, then together with a> \, we obtain ^ 2 a -1 0. Hence, a„ —)■ 0 and 

the lemma follows. ■ 


We can now give the main result in this section, which bounds the expected completion time in the dissemination tree T„. 

Theorem 3. The expected completion time E [M„] needed to broadcast one packet to all receivers in the dissemination tree 
is bounded by 


E [M„] > 'tSfjfhL _ + i,„ 


(78) 


■p logp 

where bn was defined in eq. (65), and kn satisfies eq. (67). 

Proof: Since Lemma 5 and Proposition 1 hold for the sequence Y = max (Yi,..., Y„), then, by Theorem 2, for sufficiently 
large n. 


s-P"” <Pr(y<u„)<e-P". 


(79) 


Therefore, as indicated previously, Y — bn is bounded by a non-degenerate distribution following the Gumbel law, hence by 
eq. (3) 


and in particular 


Hence, 


6 „ + r^<E[F]<5„-^ + l, 

logp logp 


E [Y] > — -h bn- 


logp 


E[M„] >E[M' 


(80) 


(81) 


(82) 


= E [lY + Y] 

=E [lY] + E [Y] 
Ml 0 g,n/t„+ T 


1 -P 
1 


1 


logp 

log 2 n + o(logn), 


fi-p logaP, 

where (a) is by Lemma 2 and (b) is according to the construction of the lower bound. In (c), we used the expected value of 
NB (eq. (15)) for lY and for Y. ■ 

Theorem 3 presents a lower bound on the expected completion time in the dissemination tree for large enough n. The 
choice of kn has a crucial effect of the convergence of the CDF of Y, and as a result, also on E [Y]. In addition, note that 
asymptotically, fc„ has no effect on the the multiplicative factor of the leading term, logn. However, as noted above, the 
bound is tighter as increases, and in particular, tighter than the bound defined in 3, which refers to kn = 1. 


B. Upper bound 

In this section, we provide an upper bound on E [M„], by considering an independent sequence of NB random variables 
with log 2 n successes. We study the distribution of the maximum of that sequence, and as a result, we also find an upper 
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bound of the tail distribution of M„. Namely, for each e > 0, define the probability 1 — e to be that after T time slots, the 
message reaches to all nodes in T„. Then we find an upper bound on the completion time T as a function of e. 

Let ^Ti,..., r„^ be i.i.d. random variables with a marginal distribution function F [x| log 2 n,p], i.e. Ti ~ NB (log 2 n,p), 

and let Mn = max ..., ■ In the following lemma, we make precise the intuitive result that the maximum over the 

leaves in the tree T„ is upper bounded by the maximum of i.i.d. variables with the same marginal distribution. 


Lemma 6. 


E [Mn] < E 


Mn 


(83) 


Mo. 


. As mentioned above, Mon can 


Proof: We prove the lemma by induction on the height of T„. Let h = log 2 n be the height of T„. For the base case, 
i.e. h=l, clearly Pr (Mn < a;) = Pr and thus E [M„] < E . For the induction step, assume that the claim 

holds for some h, namely E [Mn] < E Mn . We prove it for h+\, i.e. E [Mon] < E 

be represented recursively, namely, the dissemination time of Ton is the maximum Between the dissemination time of two 

independent subtrees and Tn, plus the time it takes to forward the message to the root of each one of these subtrees. 

Formally, let Wi and W 2 be i.i.d. geometric distributions, and Mn\M^'^ be the dissemination time of each one of Tn'^ 
( 2 ) 

and T„ , respectively (see Fig. 4a). Then 

E [-^2n] = E 

< E 


max (Wi + , W 2 + 

max {Wi + M^'>, W 2 + 


(84) 


where in the last step we used the induction step (see Fig. 4b). Thus, in order to complete the proof, it is sufficient to 


show that E 


max (Wi + Mi^\W 2 + M^n'’ 


< E 


M 2 . 


. Now, for this part of the proof, we use the same technique we 


used in Lemma 2, that is, M 2 n = rnax {Wi + T); 1 < z < 2n^, where is an i.i.d. sequence following a geometric 

distribution, and {TilfLi is also i.i.d., but Ti ~ NB {h,p), (see Fig. 4c). Now, denote h = argmax 1 < z < and 
I2 = argmax rz + 1 < z < 2n^. Therefore 


M 2 n = niax {Wi +7); 1 < z < 2n 
( max (Wi + f)) , 

max (iLi + M^^\W2 + , 


(85) 


= max 


> max I 


max 

n<j'<2n 




which completes the proof. 


By Lemma 6, E 




is an upper bound on E [M„]. Therefore, we now derive a closed-form expression for Pr , 


and use this distribution to find E 




J. Note that, as mentioned earlier, the extremes of rz random variables with NB marginal 
distribution have been already studied by [20], [24], for a fix number of successes, while here we consider log 2 n successes. 
At first, similar to the previous section, we define a continuous random variable with CDF Fc{x), such that for any real x, 
Ff.(x — 1) < F[x] < Fc{x), and for an integer x, Fc(x) = F'[x]. The next step is to find a sequence 6„, which satisfies 
Fcibu) = 

Since 


F[x[m,p] = Pr (X > x) = 


E 


Pr {X = k) 


( 86 ) 


But, with the expression above, it is not clear how to define Fc (x). Although we can use eq. (52), here we find Fc in a 
slightly different way. In the next lemma, we present a closed form for F[x[m,p] when x has the form x = am + dm- 

Lemma 7. Let X ^ NB (m,p), and define the sequence bm = am + dm, where a is a constant satisfying a > and dm 
is a negative sequence such that dm = o (m). Then, for sufficiently large m, 

Pr (x > bm) = Cm Pr [x = bm + 1 ) (87) 

where Cm C, as m —t 00 , and C = p - 
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(a) Illustration of the dissemination tree T 2 n, which consists of two identical subtrees 
and with completion times and M^\ respectively. Each one of the subtrees is 

connected to T 2 n via a single channel, where the time it takes to forward a message through 
is denoted by Wi and W 2 , respectively. 



O 


O 


o 


o 


(b) Illustration of 2n independent paths, each has h hops that merge into Wi and W 2 - 


9 9’ 

Wi 

o- 




0 O 


0 0 


o o 


0 c) 


9 9 
ro o 




0 0 


o 


6 


o 

IV2„ 


0 0 


>T2„ 



(c) Illustration of 2n independent paths, each has h + 1 hops, where the time to forwai'd a 
message through the first hop is denote by and the remaining h hops are denoted 

by 

Fig. 4; Phases of the induction in the proof of Lemma 6. 
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Proof: See Appendix F. 


Lemma 7 shows that the tail distribution Pr > hmj, under an appropriate condition on b^a, is the PDF at + 1, 

multiplied by a constant. The PDF is defied by eq. (14). Note that for any integers n and k, (^) = ■ Hence, 

an immediate consequence of Lemma 7 is that it is possible to define a continuous function Fc, where 


F 




= c„ 


r(&m + 1) 






r{m)T{bm — m + 2) 

In the next lemma, we find an asymptotic approximation for Fc ■ 

~ if 

Lemma 8. Let bm = am + dm, where dm satisfies 0 as m — >■ oo. Then, for sufficiently large m, 

/ \ m 

1 1 


( 88 ) 


F 


(fcm) 


= C'„ 


(a - 1) 




ma—IV 27rV \ m 


(89) 


where S = -^^p. 

Proof: See Appendix G. 


As mentioned before, we are eventually interested in studying the extreme value of random variables with marginal 
F[a;| log 2 n,p], hence, later we will assign m = log 2 n, and clearly = feiog^n, namely has the form 5„ = 
a log 2 n + o (log n). In order to find an appropriate constant a and sequence dm such that Fcifin) = -, we will use two steps: 
At first we find a satisfying 


- - -= - 

(a-l) 2 


(90) 


subject to a > and then find dm such that 






y/m {a - 1) 


= 1 . 


(91) 


However, it is not clear whether the solution for eq. (90) exists, and in particular if it satisfies the condition present in 
Lemma 7, i.e. the solution must satisfy that it is larger than In the next proposition, we argue that there exists one and 
only one solution for a in eq. (90), satisfying a > 


Proposition 2. For any a > 1, define 

CX.^ 

5p(a) = 2;-(1 - P) - 1- (92) 

(a-l) 

Then, for each 0 < p < 1, there exists exactly one root for gp{a), ap, which satisfies otp > 

Proof: See Appendix FI. ■ 


In Fig. 5, we plot the roots of pp (a): ap, and a*, together with the maximum value, i.e amax = It can be seen from 
the graph that ap > > a*, for all 0 < p < 1, and in particular, there is exactly one root satisfying ap > Hence, 

we can conclude that there exists a unique solution for eq. (90) subject to a > which implies that bn has the form 
bn = ctp log 2 n + o (logn). The next lemma presents an explicit expression of bn- 


Lemma 9. Let Fc (x) be the CDF of a continuous random variable as defined in eq. (88), for real x > log 2 n—1, and define 

bn = ap log 2 n + log^ Vlog 2 n + (3, (93) 

where ap is the root of gp (a), and 


a 


^ 


ap 1 


I27T ap — 1 


Then 


6p C 
lim nFc {bn + a:) = S^. 


(94) 


(95) 


Proof: By Lemma 8, 

nFc {bn + x) 


(96) 


20 




Fig. 5; Plot of Qp {a), with the roots ap, a* and amax, where amax = as a function of p. 


log 2 n 


=nC„ 


(ap - 1 ) 


•13 /y _ 

zrrP ” 




1(1 _ p) j (5'og6 Vlog 2 n+/3+x 


(ap - l)v1og^ V 27r 


g(i+»(i)) 



as n ^ oo. 

The next theorem gives bounds on the limit distribution of M„. 

Theorem 4. Let + x, for bn defined in eq. (93). Then, 

limsupPr (hln < Wn') < e“'^ 


< Ur^ 


lim inf Pr ( Mn < ) > e 

n—>-oo 


-5^ 


Proof: recall that Fc(x — 1) < F’ [a:| log 2 n,p] < Fc(x), therefore 

Pr (^Mn <Un^ = F [Un\ loga n,p]"‘ 

<FAUnr 



where the last step follows from Lemma 9. In the same way, 

Pr {Mu <u^ = F [un\ log2 n,pY" 

> F, (Un - 1)" 



(97) 


(98) 


(99) 
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Remark 3. Theorem 4 may be also used to extend the model an unreliable single-hop broadcast networks, as mention 
earlier, where k = log 2 n. In particular, the theorem reveals that the number of redundant packets required to guarantee file 
completion by all receivers with probability bounded between e~^ and e~^ is bn + x. In Section VII, we will generalize 
it for K = M log 2 n, for some constant M. 


By Theorem 4, the CDF of — bn is bounded by two CDFs of continuous random variables, each of which follows the 
Gumbel distribution. Denote them as T„ and T;. It can be verified that 


hence 


E[Ti] 

E[T,] 


7 

log 5 


log (5’ 


7 

log (5 


< E 



< bn 


7 

log (5 


+ 1 . 


( 100 ) 


( 101 ) 


Now using Lemma 6, we get the upper bound, i.e. 

E [M„] < ^ + 1. 


( 102 ) 


C. Summary 

In this subsection, we summarize the results of the two previous subsections. Joining the result of the lower bound in 
Section VI-A, and in particular eq. (82), and the result from the upper bound from Section VI-B eq. (102), we have 

E [Mn] > ( -:j-^—) log 2 n + o (log n) (103) 

VI-p log2P/ 

E [Mn] < ap log 2 n + o (log n), 


where ap is the roof of pp (a), given in eq. (92). Since it is clear that the bounds differ only in the multiplicative constant 
factor of the dominant component (logn), define 


then, clearly by eq. (103) 


* 


a 


lim 

n—>-oo 


E[M„] 
logn ’ 


(104) 


((l-p)log2 



< a* < 


CXp 

log 2' 


(105) 


In Fig. 6 we give the constants iog 2 ~ of the lower bound and of the upper bound on a* for 0 < p < 0.5. 
We can see that the bounds are tight for small enough p. 

In addition, by Theorem 4 and Lemma 6, we conclude that for large enough n, 

Pr (M„ < cc) < . (106) 


Therefore, for e > 0, define the probability 1 — e that after T time slots, the message reached all nodes in T„. Then, 
1 - e < e"^ “ ", which implies that 

T <6„+log 5 (-log(l - e)) (107) 

=bn + log^ e + O (e). 


Namely, the time it takes to disseminate a single message to all nodes in T„ (with probability of 1 — e), is no more than 
('n + loga (-log(l-e)). 

Simulation results of the CDF of Mn, and also E [M„] with the proposed bounds, are available in Section IX. In the next 
two sections, we generalize our model to the case where source send M messages, and where T„ is K-ary tree. 


VII. Completion time with M messages 

Thus far, we have discussed the case of disseminating a single message in the network. However, in a practical system, it 
is most likely that one would need to disseminate several messages. Fortunately, the methods above can easily extend to the 
general case. For simplicity, we now assume a full binary tree, but, this result can be readily generalized to K child nodes 
as well. 
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Fig. 6: The upper and lower bounds on a* as given in eq. (105), where a* was dehned in eq. (104). 


Regarding the lower bound, there is no change in the structure as given in Section VTA, i.e., we divide T„ vertically at level 
log 2 kn, where there is only a single path which transmits the message from the source to the roots of , • ■ ■, " , 

(see Fig. 2). The time it takes to forward M messages one after another through log 2 hops is denoted by W. Hence, 

W ~ NB (M log 2 In addition, the asymptotic approximation of the normalised Y (where Y denoted the maximum 

of the completion times in , • ■ ■, " ), remains unchanged, expect that log 2 is multiplied by a constant M. 

Therefore, by Lemma 3, for m„ = M log 2 kn, we get that nFc {bn + x) —>■ for 

bn = login + (Mlog 2 A:„ - l)logi ( (M log 2 A:„)) - log i (M log 2 - 1 )! (108) 

P p \ p / ^ 

In fact, since M is a constant (with respect to n), the result holds for any kn satisfying eq. (67). Moreover, from the same 
reason, it can be verihed that Lemmas 4 and 5 hold for M log 2 instead of log 2 kn, and therefore 

E[Y]>bn- (109) 

\ogp 

and 

E[M„]>E[M^] (110) 

= E [IT + F] 

^ Mlog^n/kn , ^ 7 

l-p ” logp 

Regarding the upper bound, similar to Section VLB, dehne ^Ti,... to be an i.i.d. sequence with NB distribution 

following M log 2 n successes, and dehne = max ..., Tn j. Then, using Lemmas 7 and 8 , we can derive an asymptotic 
approximation for the CDF, with m = Mlog 2 n. As a result. Lemma 9 and Theorem 4 can be applied with the following bn- 

bn = ap^MM \og 2 n +logs ^/M\og/n + j3, (HI) 
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Fig. 7; Upper an lower bounds on a*, given in eqs. (110) and (113), (a* was defined eq. (104)) as a function of M, for 
p = 0.1,0.5. 


P and 5 were defined in eq. (94), and Up^M is the solution for 

-^^p“-i(l-p)=2-i. (112) 

(a - 1) 

Finally, we argue that there is only one solution for ap^M, satisfying ap_M > This can be seen by the proof of 

Proposition 2, which is easily generalized by assigning 2v instead of 2. As a result, we have 


E [M„] < E 




< br,. - 


log 5 


+ 1 


Otp^M^ , n \ 

= , „ logn + o(logn). 

log 2 


Hence, by eqs. (110) and (113), we obtain 


M 


(l-p)log2 logp 


< a* < 


log 2 


(113) 


(114) 


were a* was defined in eq. (104). 

Fig. 7 depicts the leading multiplicative constant, of the upper and lower bounds, as outlined in eq. (114). We can conclude 
that both of these bounds grow with the same scale, as M increase. Indeed, the next lemma limits the solution for to 

the area between and + em, where cm —>■ 0, as M —>■ oo. As a result, both upper and lower bounds have iog 2 
as the leading multiplicative constant. 


Lemma 10. Let, 

gp{a\M) = 2^- -(1 - p) - 1, (115) 

(a - 1) 

And denote the root of gp[a\M) as Up^M- Then, for sufficiently large M, we have 

- - < cXp M ^ - -h £m, ( 116 ) 

1 — p 1 — P 
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where sm > 0 is a slowly decreasing function, such that cm = ^ as M ^ oo. Namely, cm 0 slower than 

Proof: See Appendix 1. I 


An immediate conclusion from Lemma 10 is that for large n and large M, E 




scales as " . As a result, since 

i-p ’ 


both upper and lower bounds have the same scaling law, we obtain the completion time of M„ scales as 

E [M„] = + o (M log n). 


M log2 n 

1 -p 


, i.e., 


(117) 


In addition, recall that the arrival time of some certain end user i follows NB with M log 2 n successes. Hence, E \Ti] = ” , 

which implies that the expected completion time to disseminate M messages to n end-users through T„, scales as the same 
as the time to forward it to a single terminal node with a hop count of log 2 n. 

VIII. Completion time with K child nodes 

In this chapter, we generalize our problem, to the case where each node in the multicast tree has exactly K child nodes. 
We distinguish between two cases: First, when K is a constant, and second, where K is an increasing sequence with n. For 
both cases, we assume that the multicast tree has n end users (leaves). The height of the full K-ary tree with n leaves is 

log n 


hn = logi^ n = 


log AT' 


(118) 


A. Constant K 

We start with the case where K is constant. We will consider here the limit as K grows, however only in the sense that 
n —)■ oo is taken first. Regarding the upper bound, we would like to find an asymptotic approximation for the distribution of 
M„, i.e., the maximum of n i.i.d. random variables, each following a NB distribution with ft,„ successes. Hence, we can use 
Lemmas 7 and 8 in order to derive an asymptotic approximation for the CDF, with m = hn- Consequently, Lemma 9 and 
Theorem 4 can be applied with 


hn — Otp^Khn T lDg,5 s/~hn 

: logn + o (logn), 


ap,K 

log a: 


(119) 


where /3 and 6 were defined in eq. (94), but the generalized solution for ap^K is no longer eq. (90), but 


\a —1 


K 


( 120 ) 


(«-ir 

Finally, we argue that there is only one solution for ap^K, satisfying ap^K > This can be seen by the proof of 

Proposition 2, which can be generalized easily by substituting AT instead of 2. As a result. 


E [Mn] < E 


Mn 


( 121 ) 


< bn - 


log 5 


+1 


= ^^iogn + o{logn). 

Regarding the lower bound, there is no change in the structure as given in in Section VI-A, except that the partition is done 
at level log;^ fc„ C 1, 2,..., log^ n — 1. Hence W ~ NB (iog^ In addition, the asymptotic approximation of the 

normalised Y remains unchanged, expect that log 2 kn is replaced by log^^ kn- Namely, for sufficiently large n. 


e ^ < Pr (F < Un) < e ^ 

for Un = bn + X, where 

= log 1 n + (log^ kn - 1 ) log 1 


1 - A, 


P 


T- (log;^ kn) - log 1 (log^ kn - 1)! 


For further details see Appendix J. 
Therefore, we write the lower bound as 


E [M^n] > 


log*: n/kn 7 


l-p 


logp 


( 122 ) 


(123) 


(124) 


1 


1 


(l-p) log a: logp 


log n + o (log n) 
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Fig. 8; Graph of the the upper and lower bounds on a*, as a function of K, with exponential scale, for p = 0.1,0.5. The 
coefficient of the lower bound is and the upper bound is We can see that both these bounds, are 

converge (relatively slow) to — 


Note that for a fix n, we expect that E [M„] would decrease as K increase, since the number of successes required for each 
end-user (i.e. is decreased as a function of K. As a result, we expect that a* would also decrease as K increase. 

Indeed, for the same reason, this claim holds also for the i.i.d. case. Hence, we also expect that the upper bound on a* (i.e., 
log g ) would decrease. For the lower bound, it is easy to see in eq. (124) that the lower bound on a* indeed decreases as 
a function of K. In Fig. 8 we plot these bounds on a* as a function of K for p = 0.1 and p = 0.5. For the lower bound, 
since the multiplicative constant of the leading term is ~ for AT —> oo the expression 

goes to — . However, we can also see that the upper bound tends to — as well, for large K, which implies that these 

bound are getting tighter as K increases. Note that — logp is a limit inferior for a*, since each end-user has at least one an 
exclusive channels. Thus, as we seen in Section VTA, the completion time of disseminating to n nodes, scales as logi n. 


B. K growing with n. 


Next, we consider the case where K (in fact Kn) is monotonically increasing with n. We distinguish between two cases: 
the first is where the height of the tree (denoted by h) is constant, and in the second it is monotonically increasing with n. 

1) Constant h: As mentioned above, we can bound E [M„] from above by the expected value of the maximum of an i.i.d. 
sequence with the same marginal distribution. Since h is constant, then clearly by Proposition 1, for log 2 = h, 

E [M„] < + ^ + 1 (125) 

logp 

= logi n -I- o(logn), 

P 

where 


bn = logi n+ (h-l) log 1 


1 -F. 


4' (ft.) ) — logi (ft — 1)! 


(126) 


Furthermore, Theorem 2 guarantees that if both local and long range dependence conditions are satisfied, then it is possible 
to obtain tight bounds (which differ only by 1) for E [M„]. 

Note that T„ has Kn child nodes, each having n/Ar„ leaves, which are independent of each other. Hence, T„ can be 
considered as Kn dependence (namely, the set with the indices Kn — 1}, {Kn, ■ ■ ■, 2Kn — 1},... are independent), 

which implies that the mixing condition holds. However, unfortunately, Dj, (m„) does not hold. For more details, see Appendix 
K. Therefore, it is impossible to ensure a tight bounds for E \Mn\ using Theorem 2. 
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Nevertheless, similar to the lower bound as presented in Section VI-A, denote to be the maximum of n random variables 
that follow a NB distribution with h successes, where the first ft, — 1 channels are in common for all users, (denoted as W), 
whereas the last channel is independent for each one of the users, denoted as (Yi,..., y„), and Y = max (Yi,..., Y„), where 
Yi are i.i.d. geometric random variables. As a result, we have 

E [Ml] = E [lY] + E [Y] (127) 

ft — 1 7 

> -f log 1 n - - - 

I — P P log p 

= log 1 n + o (log n), 

P 

where E [W] is the expected valued for a NB with ft — 1 successes, and E [Y] was already found in eq. (69). 

In conclusion, for constant ft, although it is not possible to obtain the exact expression, we may still find very tight upper and 
lower bounds, such that both scale as logi n, namely both have the same constant multiplicative factor. Hence, the scaling 

P 

law is known exactly. Note that, unlike previous cases, when h is constant, the correlation between any two sets of random 
variables, is not increasing since the number of the correlated channels (i.e., channels that are mutual for both sets) is not 
increased, which causes the scaling law to behave as the i.i.d. case. 

2) ft„ is an increasing sequence: Similar to the previous subsection, the lower bound is also valid in this case. For the 
upper bound, note that by Lemma 3, and specifically by eq. (62), the result of the previous subsection can be applied also 
for ft„, as long as it satisfies 

^ 0 . ( 128 ) 

log n 

as n —> oo. When eq. (128) is not satisfied, we found that it is possible to apply Theorem 4 on the distribution of M„, for 


where 


and Qfp K is the solution for 


b„ = ap^K„ log n + log^ s/\ogn\ogK„ + /3, 


/3 = logi 


{i-pY 


Therefore, 


E [Mn] < E 


— bn — - 7 + 1 

logt) 

= log n + o (log n). 


For further details see Appendix L. 


IX. Simulation Results 

In this section, we present some simulation results in order to examine important aspects of the analytical result given 
thus far, such as convergence rates and the tightness of the bounds. The simulation clearly depict the tightness of the bounds 
given, and their ability to closely approximate the correct completion time under various system parameters. In some cases, 
however it is clear that the number of nodes should be large before the asymptotic kicks in. 

The first simulation was done for the main result, that is, the expected value of the completion time, E [M„]. In particular, 
we considered a multicast network, with a structure of a full binary tree T„, with n = 2^, N = 4,... ,23, and for 
p = 0.1, 0.2,0.5, and we checked the expected completion time to disseminate a single packet over T„. We evaluated the 
suggested completion time through extensive MATLAB simulations, and the results are given in Fig. 9. For each N and p, 
we averaged over 10"^ realizations, where on each observed value, we found the maximum of 2^ random variables, each of 
which is the sum of N geometric random variables with probability of failure p, where each geometric random variables 
represents the numbers of trials needed to forward the packet through a certain channel. We compared the result to the upper 
bound, i.e. E , given in eq. (102), and the analytic result of the three lower bounds, as proposed earlier in eqs. (46), (71) 
and (82). Recall that for the lower bound proposed in eq. (82), we have a degree of freedom, by the choice of fc„, where large 
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(c) p = 0.5 

Fig. 9; Simulations for E[M„], p = 0.1,0.2,0.5 (averaging 10^ realizations), together with the upper bound (ap = 
1.78807,2.25379,4.4035 respectively), as given in eq. (102), and three different lower bounds: the simple one written in 

as in 
.,23. 




corresponding to eq. (71); Finally, E [M^] 


the graph as E [T] and outlined in eq. (46); The second one, E 
eq. (82), where for p = 1, (Fig. 9a), we have chosen log 2 A:„ = 4 
For p = 0.2 (Fig. 9b), log 2 A:„ = 4 for all N. Finally, for p = 0.5 (Fig. 9c), log 2 fc„ = 4 for = 3,..., 10 and log 2 kn = 7 
for Af = 11,...,23. 


or )V = 3,..., 10 and log 2 = 7 for A^ = 11, 


kn results in a tighter bound, but the convergence may be slower. Hence, in the simulations below, was chosen for the 
tightest bound possible. However, to show the dependence on kn, we conclude this section with simulations that specifically 
address this issue, and give bounds of different values of A:„. 

In addition, we also plot in Fig. 10 a simulation for N = 5, 10,15 as a function of p, for 0 < p < 0.2. As shown in these 
graphs, the bounds hold for each N, including relatively small N. Moreover, one can see that the gap between the simulations 
results of E [M„] and the upper bound which is based on the i.i.d. scaling law, is increases as N increase. This implies that 
it is indeed not possible to apply Theorem 2 for the problem at hand, as Theorem 2 would imply that the extreme value 
distribution of the correlated sequence should converge to the same limiting distribution as if the sequence were i.i.d. 
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p 

Fig. 10: Simulations for E [M„], for N = 5, 10,15 as a function of p (averaging 5000 realizations), together with the upper 
lower bounds for log 2 = 4, given in eqs. (82) and (102), respectively. The bounds for any N are the corresponding dashed 
lines. 


Further result that was examined is the gap between the completion time T, and its upper bound, as a function of the 
probability that a message is successively received by all nodes in T„, as we found in eq. (107). The result is given in 
Fig. 11, on a log scale. Note that the linearity of the graph results from the fact that log^ (— log (1 — e)) = log ,5 e + O (e). 
The construction of the simulation is made by calculating the empirical distribution function of M„, from the 10® generated 
random samples data, for n = 2^, N = 5, 10,15 and for p = 0.1, 0.2,0.5. In general, we can conclude from the graph that 
the simulations and the upper bound are roughly close to each other, for each N and p. Furthermore, we can see that for 
large N, i.e., iV = 10 and N = 15, the gap remains stable, and in particular does not increase, even for very small e, while 
for relatively small N, i.e., N = 5 there in a moderate increase in the gap, as e decreases 

An additional conclusion, is that the slop of the upper bound (and hence also for the simulation result) is only negligibly 
effect by N, yet is obviously affected by p. This can easily be seen from eq. (107), i.e., the expression log^ (— log (1 — e)) 
is a function of e and p, but not of N. 


The next simulation is made for E[y], that is, a part of the lower bound as outlined in Section VI-A. Namely, Y is 
the maximum of n random variables which are parted equally into ^ subtrees with height of log 2 kn- We checked the 


convergence of it to the i.i.d. case (denoted as E 


Y 


), which simply means that Y = max 




, where the sequence 


is i.i.d. and follows the same marginal distribution as (Yi,..., Y„). We also plot the analytical upper and lower bounds (which 
differ only by one) as in eq. (80), as a function of number of users 2^, for W = 1,..., 24, and for p = 0.05,0.1, 0.5. Results 
are plotted in Figs. 12a to 12c, respectively. For each p, the figure depict the cases of log 2 kn = 4, 7,10. 

The first conclusion is the convergence of E [Y] to E Y depends on kn- That is, as kn is greater, so the convergence of 

E [Y] to E Y is slower, and it can be clearly seen by the proof of Theorem 2. Specifically, the difference between the 

distributions of Y and Y is reflected in eq. (51), where kn is a dominant factor for the difference between them. Note that 
Theorem 2 is on the distribution, not the expected valued, yet, the relationship between them is clear. Another conclusion 


on the relation between E [Y] and E 


Y 


is that the impact of p is relatively negligible for the values that were simulated. 


which is contrary to the proof of Lemma 5, where it can be seen that p may affected the convergence. Yet, clearly p has no 
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P=0.1 p=0.2 




p=0.5 



Fig. 11: Simulations for the number of time slots T required until all the nodes received one packet in T 2 iv with probalility 
1 — e, as a function of e, for p = 0.1,0.2, 0.5, and for N = 5, 10,15, with the upper bound given in eq. (107). 


impact on the local dependence of (Yi,..., Y„). Next, we examine the convergence to the analytical upper and lower bounds 


(which are marked as black dashed lines), for both E [Y] and E 


Y 


. Recall that the rate of the convergence is determined 


according to the choice of and it can be seen in the transition from eq. (42) to eq. (44). Hence, both p and have an 


impact on the converge in the analytical bounds, for both E 


Y 


and E 


Y 


. It is clear that for small p, i.e., p = 0.05,0.1 


both values converge between the upper and lower bounds in the range of N that was tested. However, for p = 0.5, we can 
see a relatively large error. Yet, we should note that first, the packet loss probability p is usually small, hence we should put 
more attention to the cases of small p, rather than large p. Furthermore, it is clear that the distance to the analytical bounds 
does get smaller as N increases, hence we expect it to converge for very large N. Moreover, we propose that for p = 0.5, 
to take bn such that bn may replace bn defined in eq. (65), 


bn = log 2 n + (log 2 kn - 1) log 2 (log 2 kn)j - log 2 ((log 2 kn - 1)!) 


(133) 
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(a) p = 0.05 


(b) p = 0.1 




(c) p = 0.5 


(d) p = 0.5 


Fig. 12; Simulation of E [F], compare with the i.i.d. case E 


Y 


, (averaging 500 realizations), together with their upper and 


lower bounds (black dashed lines) for log 2 kn = 4, 7,10 and p = 0.1, 0.2,0.5, given in eq. (80), where in Figs. 12a to 12c 
we use bn suggested in eq. (65), and in Fig. 12c, bn suggested in eq. (133). 


4' {x) = log 2 n + {x — 1) loglogn — log 2 (x — 1)!. 


In order to show that proposed above is an appropriate sequence, one needs to prove that the conditions for Theorem 2 
hold and in particular eq. (32) and condition D^. (m„). We skip these proofs. However, note that differs from by only 
4' (compare to T' defined in eq. (55)), hence, one can see that 0. In Fig. 12d, we plot simulations of E [F], and 


the i.i.d. case E 


F 


, same as Fig. 12c, however, the upper and lower use instead of We can see that the convergence 
of both log 2kn = 4, and log = 7 is significantly better. 

Finally, in order to explore the convergence of the expected value of (i.e., the upper bound for E [M„] as defined in 
Section VI-B), we simulated, by choosing the maximum from n i.i.d. random variables, following NB distribution with log 2 n 
successes, and averaging over 5000 realizations. The results are plotted in Fig. 13, for p = 0.1,0.5, 0, 7. For each value of p, 
the figure depicts the simulation result together with the analytical upper and lower bounds, given in eq. (101), as a function 
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Fig. 13; Simulation of the expected value of maximum of 2^ i.i.d. NB with N successes and p = 0.1,0.5, 0.7 (averaging 
over 5000 realizations), together with their bounds (black dashed lines) given in eq. (101) 


of N. Note that the upper and 
even for small N, namely E 


lower bounds differ only by 1, for any valued of N. We can see that Equation (101) holds for 


M2N 


is confined between E [T;], and E [T„] for different values of p and even for small N. 


X. Conclusion 

f In this work, we presented an asymptotic analysis of the completion time when disseminating M messages through a 
multicast network, using EVT as a primary tool. 

We characterized the limit distribution of the normalized maxima of NSIV sequences, under appropriate dependence 
restrictions, and specifically, we found that under appropriate conditions, the asymptotic distribution behaves as the same as 
if the sequence were i.i.d. Possible extension is to study NSIV sequences who have a non negligible local dependence, and 
non-identical marginals. 

The result of EVT enabled us to bound the scaling law of the expected completion time. In particular, we found that 
these bounds are depend on p, where for small p, the bounds are tight. In addition, for large M, we found that the expected 
completion time scale as " . Tight bounds were also found for the case where the hight of the multicast tree is small 

enough. In addition, we also found an upper bound for the exact distribution of the completion time. Possible extension is to 
assume heterogeneous packet losses. Note that the distribution of is no longer NB, thus the main challenge will be to find 
an appropriate for the this distribution. Another possible extension is to assume some correlation between the channels, 
where in this case, the main challenge is to show the dependence conditions hold. 


Appendix A 
Proof of Lemma 1 

Similar to the proof of [19, Lemma 2.1], divide each sets /i,..., into subsets Ii i, Ji 2 , T 24 , / 2 . 2 ) ■ • ■) ^ 2 ,rn where 

Il,l = {1, 2, . . . , A:„ — („} Ii^2 = {kn ~ + 1) • • ■ ) kn} (134) 

-^ 2,1 ~ \_kji -f 1, . . . , 2kji ^2^2 ~ { 2 ^n “t” 1,..., 2 A;^} 


^r-n,l — {(t*n l)^n “f 1, . - . , ^nkn 

kvn ,2 — \^nkn 1, . . . , It} 
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Hence, each pair of intervals 1;^ i are separated by at least In, where In be as in the definition of D{un)- Obviously 
we have 


Pr 


n^(4i) 


Pr {Mn < Un) < EE Pr {X, > Un) 
1=1 iei,^2 


(a) 

< {rJn + kn) Pr {Xi > Un) 




“t“ kn) 



\ n 



(135) 


as n —oo, where (a), true since Xi have identical distribution for all i, and each 2 contain In integers, except Ir „,2 
having n — r„fc„ + + In, (recall that r„ = [n/fc„]), and (&) we use eq. (28), then there exists constant K such that 

Pr {Xi > Un) < Next, by [50, Lemma 2.1] it follows that 


Pr -J]Pr(M(4i)) 


1=1 


< TnUr. 


Finally, denote Ii = {// 1 U Ii, 2 }, and M^(*) be the complementary event of M, then 


(a 


YlMiI^^,)-YlMiI^) < ^(Pr(M(4i))-Pr(M(J0)) 




1^1 


Z=1 


(b) 




1^1 


<£pr(M^ ( 42 )) 


1=1 


(136) 


(137) 


= Epm U 

1 = 1 

/ \ 'l^n — l 
(c) 


< In'Pr {Xi > Un) + kn Pr {Xi > Un) 


Z=1 


< K Tn— + — ^ 0, 

n n 


where (a) using | Jli 3^* “ Hi 2/il < \ Y.i^i- D'i.V for a;*, Vi € (0,1], (5) holds since Pr (A) - Pr (A n B) = Pr [A n B^), 
and in (c) we apply union bound. The result now follows at once by combining eqs. (135) to (137) 


Appendix B 
Proof of Claim 1 


Let a„ be a sequence such that —>■ a < 00 , as n —)■ 00 , then 


lim 

m—>-oo 


(1 + — 

V TO / 


= e 


(138) 


Proof: We prove it by the definition of the limit of sequence. Let e > 0, and e = log (|e “ + l), and note that e > 0. 
Since > a, there exists TOi, where for each to > to-i satisfy jam — a| < e , thus 

' e 


Urn < a + 


log ( 


2e-“ + l). 


Hence, for each to > TOi, we have 


(i I Q + log(ie ° + l) Y 
V m / I m I 


(139) 


(140) 


let e = f, then by the definition of the limit of the exponential function, there exist m 2 , such that for each to > m 2 , we 
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have 


/ / _ \ \ ^ 
l+ ^ + log(i"-° + l) ] <e“+>°g(i^-“+i)+e" 


=e + 2 +^ 
= e“ + e. 


Now set Too = max ( 7711 , 7712 ). similarly, on the other side, for e = — log (l — |e “), one can verify that there exist toq, 
such that for each m > mo. 


which complete the proof. 


(1+ — 

V m / 


> e - e. 


(141) 


Appendix C 
Proof of Lemma 3 


By eq. (22) we have 

B{p; (j){n) - m„ + 1, m„) = 


P 


+ _ pY 


(j){n) - m„ + 1 


F((/)(r7) + 1, l;^(r7) - m„ + 2;p), (142) 


where, by eq. (23), it can be verified that 

F{4>{n) + 1,1; (j>{n) - m„ + 2; p) 


= 1 + 


(j){n) + 1 


{(pjn) + 1) {(j) (n) + 2) 

(j) (n) - rrijn + 2^ ' (n) - m„ + 2) {(j) (n) - m„ + 3) 

00 k 


-.p + 


P^ + ' 


=i+En 


fc=ij=i 


(j)in) + j 


(l){n) -run + l+j 


■ P 


and 


n ( _) = n (1 + 

k 

sn(i+ 

i=i 


m„ - 1 


(j){n) - m„ + 1 + j 
m„ - 1 


4>{n) — mr. 


= 1 + 


Denote 


g{n) =p(l + 


rUn- 1 
4>(jl) — rUr, 

<f>(n)-m )’ P n ^ 00 . Then 


rrin — l 


00 / .| \ 

F{(j){n) + 1,1; (j){n) - m„ + 2;p) < 1 + E ( ^ + ,E_ -) 


00 

= E^w^ 

k=0 

(a) 1 


1 - g(n) 


1 m. 


1 -p ' yiognj ’ 
where (o) hold for large enough n, such that g(n) < 1. On the other hand, 

00 fc / ir \ . 


F{4>{n) + 1,1; (j>{n) - m„ + 2;p) = 1 + ^ . P 

^ \nn) -mn + l+jJ 


(143) 


(144) 


(145) 


(146) 
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Hence, by eq. (142), for large enough n, we have 


oo k / ^ 

=i+vnfi+ ^,,. 

\ -TTln + l+J 

OO 

> l + £/ 

k=l 

1 

l-p' 


P 


k 


B{p; (j){n) - TO„ + 1, m„) = 


P 




4>{n) - m„ + 1 


1 + 0 


/ nir, 


Vlogn 


In addition. 


, 1 ^ t (O r((/)(n)-m„ + l)r(m„) 

B\(p[ri) mn + k,Tnn) — 

r (0 (n) + 1) 

(|) (m„ - 1)! 

" nr=v'(<^H-j)’ 

where (o) is due to eq. (19), and (6) results from eq. (20) and the fact that to„ € N. Hence, we have 




B{(j){n) - TO„ + 1,TO„) 

I m 


(6) nr=0 ' ('^H - j) „+l(l _ 


( to „ - 1 )! 

(fO _ iij=o _ 

n°‘ {(j){n) - m„ + 1) (m„) 


nr=v'w^)-j) 


(j){n) - m„ + 1 

( 1 + O 


1 + 0 
/ Wn 

\logn 


/ TO„ 


Vlogn 


^ P^ ( \ (rrin) 

n“ ^ ^ 0(n) — m„ + 1 


m-n 

logn 


1 + 0 


\logn 


(d) p-^ 


= W 1 + *^ 

n" ' 


( 147 ) 


(148) 


(149) 


where (a) follow from eq. (52), (6) from eq. (148), and (c) is by assigning 0(n) as defined in eq. (54). Finally, (d) is since 


^Q:(7n-n) 

0(n) —m^i + l 


= 1 + 0 


V log " / ■ 


Appendix D 

The following lemma is a technical claim, which will be required further. 

Lemma 11. Let /[a;|TO,p] be the PDF of NB distribution with parameters m G N and 0 < p < 1, as defined in eq. (14). 
Then 


Proof: Since 


Writing k' = k — M, we get 


As a result, for any M € N 


f[k\rn,p\ (k-j) < 


k—Ti 


i=0 


(2m-1)! 1 

(m — 1)! (1 — pY 


Yf[k\M,p]=Y = 


E 

k'=0 


k' + M-l\ y 


M-l 


/ (1-p)^^ =1. 


OO M— 1 


Yp^ n (*+7)= 


fc=0 j—1 


(M-l)! 


(150) 


(151) 


(152) 


(153) 
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Consequently, 


^ f[k\m,p] {k-j) = Y^ “(1-p)™ Jl (k-j) 


fc'=0 ^ ^ 7 = 1 


k' + m — C 


oo m—1 m 


' ' k'=0 i=l 

(a) (l-p)"^ (2m-1)! 
(to - 1)! (1 
( 2 to — 1 )! 

^(to- l)!(l-p)”" 


fc'=0 

oo 2m— 1 


where (a) is by using eq. (153) with M = 2m. 


The following lemma is used for the proof of Lemma 4, and in particular, bounds the Probability Generating Function of 
min {Zi, Z 2 ), Zi ~ NB {m,p) i = 1,2, at point i. 

Lemma 12. For two i.i.d. random variables Zi, Z^, with Pr {Z < z) = -F'[z|to,p]. 

00 /i\fc /dX™ 

Pr(min(Zi,Z2) = fc) < 2 I - I . (155) 


Proof: Denote by iXnin[z|TO,p] = 1 — (F’[z|to,p])^, the distribution of the minimum of two discrete in random variables 
with distribution F[z\m,p\. Then 

00 /I \ fe 

^ f -j Pr (min(Zi,Z 2 ) =/c) (156) 

k—m ^ 

/1 \ ^ 

= ^ [-] {F^in[k\m,p]-Fynin[k-l\rn,p]) 

k—m 

= ^f-j (F[/c - 1 |to,p]^ - F’[A:|to,p]^) 
k—m XP/ 

/l\^- 

= ^ ( - j {F[k — 1|to,p] — F'[A:|to,p]) (F[k — 1|to,p] + F[k\m,p]) 

k=m 

°° /1 \ ^ 

< ^ f - j (F[fc|TO,p] — F[k — 1|to,p]) (2_F[A: — 1|to,p]) 
k—m XP/ 

°° / I \ ^ 

X! j /[fc|TO,p]J^[A: - 1 |to,p]. 


Note that 


F[k- l\m,p] = 


f[x\m,p] = 


X — 1 
TO — 1 


p^-”^{i-py 
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l-p\ fx-l 
p ) \m — 1 


oo/i\fc \ m oo 

^ ( - ) Pr (min (^ 1 ,^ 2 ) = fc) <2 ( ^ 


m — 1 






j - 1 

m — 1 




f{k\m,p] 


Y f[k\m,p] 


,UYoik-j) 

ml 


(c) /l—p\™ 1 (2m—1)! 1 

^ \ p ) ml {m—1)1 (1—pY 
^ /n™ (2m-l\ 

=n; U-i 


(d)^ [A 
<2 - 


where (a) is due to changing the order of summation, (b) results from the identity (m-\) “ (m) (c) 

follows from Lemma 11, and finally, (d) uses the bound < 4™, true for each m = 1, 2,... ■ 

Appendix E 
Proof of Lemma 4 

By the construction of (Yi,..., y„), the pair (Yi, Y 2 i) shared log 2 fc„ — z common channels, and i independent channels, 

Z = 1,2, . . . ,log 2 kn - 1. 

Denote the time it takes a packet to cross the common channels by a random variable lY, where lY has CDF 

Y [zc| log 2 — z,p]. Also, denote the time it takes a packet to cross independent channels by the random variables Z\ 

and Z 2 , where and Z-i have CDF F [z|z,p]. Then, we can write the join distribution as 

Pr (Yi > zz„, Y 2 i > Un) = Pr (lY + Zi > m„, 1Y + Z 2 > m„) (160) 

= Pr (lY + min {Zi, Z 2 ) > u„) 

OO 

= Y^ > Un — k) Pr (min (^ 1 ,^ 2 ) = k). 

k=i 

The concept of the proof is to use Lemma 3 for the tail distribution Pr (lY > tt„ — fc). However, k may be too large (since 
k G [i, 00 )), such that zz„ — fc would not satisfy the assumptions of Lemma 3. 

Thus, we will break the segment [z,(X)) into two sub-segment, [z,u(i] and (z;^,oo), such that 

OO 

Pr (lY > Un — k) Pr (min (^ 1 ,^ 2 ) = k) (161) 


< Pr (lY > Un — k) Pr (min (^ 1 ,^ 2 ) = fc) + Pr (min {Zi, Z 2 ) > vY) ■ 

k—i 

Now, chose Vn such that on the one hand, satisfy Pr (min (Zi, Z 2 ) > vY) = but on the other hand, enable to 

Lemma 3 for the tail distibution Pr (lY > zz„ — k), for any k G [z, u^]. Therefore, 


Vn = a log 1 rz + (z — 1) log 1 


T'o, (z) - logi (z - 1)! + 1, 


for some 4 < a < 1. Thus, using Lemma 3, 


Pr(min(Zi,Z2) > vl) = F [z;(,|z,p]^ 

< Pc {vn - MhpY 
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Now, in order to derive asymptotic approximation for the tail distribution Pr {W > Un — k), first note that the boundary level 
Un (which was defined in eq. (65)) is related to NB with log 2 kn successes, while W ~ NB (log 2 — i,p)- Therefore, write 
Un = ui^^ + A(n), where is the boundary level for the random variable W, namely, 

= log 1 n + (log 2 kn-i-l) log 1 — -'S (log 2 kn - t)) - log 1 ((log 2 kn-i - 1)!), (164) 

P p \ p J ” 

and 

A(n) = x + logi {\0g2kn-i)] - Vlog 1 (log2 kn-r)+ log 1 f ) ■ (165) 

P \ p j 7^1 ” P V ^ (log 2 kn) J 

Note that A(n) < AT log 2 log log n, for some constant K, and k < v^, hence A(n) — k > 51ogi n, for some 0 < 5 < 1. 

P 

Then, by Lemma 3, we have 


Pr {W > Un — k) = Pr (IL > + A(n) — k) 

< Fc (ujf + A(n) - k\ log 2 kn - i) 

“ n 

^ nUi(log 2 fc«-f) / T'(log 2 fc„) y°S2fcp-i 

Note that the expression logi is positive for p < ^, and negative for p > 5 . Therefore, 

1 _ 1 

'f (log2 kn - i) log , „ + (log kn-i-l) log l ( 1 ^) 
p p \ P / 

1 

< - 

log! n 

P 

for p < i, and for p > i, it can be verified that 
As a result 

T' (log 2 kn-i) 


(166) 


(167) 


(168) 


where C = max ■ 

Accordantly, assign eq. (168) into eq. (166), would result 

Pr(lL>M„-A:)<^ (^C^log2fc„^ (C^(log2fc„))'°®^'="-'Q^ . (169) 

Now, combining eqs. (163) and (169) into eq. (160), we have 

Pr(ri >u„,r 2 . >u„) (170) 

-~ (c-^\og 2 kn^ (C'T' (log 2 fc„))^°®"''""^^Pr(min(Zi,Z 2 ) = k) (1 + o(l)) 

n \ 1—p J \pJ 

k—i 

( C' 7 -^log 2 ^n ) (C'4' (log 2 fc„))^°®"''""^^Pr(min(Zi,Z 2 ) = /c) ( - ) + ^ (1 + o(l)) 
n \ 1—p J \p) 

k—i 

<2^ (CT- (log2 [c^ log2 + 4^ (1 + o (1)) 

n \ 1 — p / n^“ 

Where the last inequality derived from Lemma 12. 
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Appendix F 
Proof of Lemma 7 


In this proof, we will show that the tail distribution at bm can be represented as the PDF 
hypergeometric function. 


OO 

Pr (X > &„) = ^ Pr {X = k) 


at 6m + 1 multiple some 

(171) 


fc = bm + l 

OO 

= E 

h 

\m — 1 


k-l 
m — \ 


^(1 -P) 


m^k—m 


“1“ ^ H“ 2, 


''(1 - F’ 

= Pr = 5m + 7 ^( 1 , 5m + l;5m - m + 2;, 

where f(^1, 5m + 1; 5m — to + 2; p^, the hypergeometric function is defined by the Gauss series (eq. (23)). We have 
F(l,bm + 1; 5m - TO + 2;p^ 

f5m + l') f5m + 2 ! 


(172) 


, 5m + 1 
= 1 + -P + 




5m — TO + 2 _ J7J _|_ 2^ ^bm — TO + 3^ 

E/nf ~ ] 

k=l j=l \i>m-m+l+ j ) 

OO k 

=i+E7^'n 


=1+ 


OO k 


am + dm+ j 
y a - 1 )to + dm + 1 + j 


1 + ^ TT11 + ^ ^ 

/= 1 V (a - 1)to + dm + 1 + J (a - 1)to + dm + 1 + j 


(♦) 


<i+E/n 1 + 


k=l j = l 

OO 


m 


(a — 1)to + dr. 


= i + ^pMi + 


fc=i 

OO 


m 


(a — 1 )to + dr 

am + dm 


^■^E [P(^a-l)m + d 

y 1 

1 _ am+d^ 

P (a— l)m-\-djn 

1 


i-P(7^ + 0(tr) 


1 - 


7=TP 


+ 0 



'd^ 

m 


where (*) holds for large enough m. On the other hand, define an increasing sequence such that Im = o(m). Then 


, 5m + 1; 5m - TO + 2;p^ 


( 173 ) 


=i+E/n 


k^l j 
Im k 


am + dm + j 


i'+Ef*!! 


\ ia- 1)to + dm + 1 + j 

OTO + dm + j 


^ 7=1 V (a “ + dm + 1 + j 
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=i+E^^'n 1 


k=l j=l 
Im k 


m — 1 




k^l i=l 
Im 


= i+E/ 1 


(a — l)m + dm + 1+ j 
m — 1 

(ci — 1)t7T. + dm + 1 + ^r; 


TO — 1 


(tt — 1)to + dm + 1 + 

OTO + dm + Im 


k=l 

/ .1.1 \ k 


= E 

k=0 

set Im = —dm (recall that dm < 0), then 


(fl — 1 )to + dm + + 1 


k=0 


= E -+ o - 

“ V a — 1 V TO 


1 




-dm + 1 


1 




o(- 1+0 

TO 


a — 1 


-d„ 


Appendix G 
Proof of Lemma 8 


Remember that by eq. (88), 


F, 




= c„ 


r(&m + 1) 


r(TO)r(5m - TO + 2) 






Using Stirling’s approximation [69, pp.257]. 


it can be verified that 


r(&m + 1) 


rM = Mr‘Y(i + o 

\ X \e/ \ 


&m+l 


r(TO)r(5m -TO + 2) /mt™ 


277 


1 + 0 


6^ — 771+2 


{bm + 1) 


brr. + l 


mx hm — TO + 2 


TO -( 5 m - TO + 2)&™—+2 


1 + 0 


Assign &m = arn + dm, we have. 


(^m + 1) 


h™ + l 


TO™(6m - TO + 2)'>™-™+2 

^ (am + dm + 1)°"^+"- + ^ 

((a - l)m + dm + 

_( \ (1 + +) 


{am + dm + 1) 


dm + l 


Note that 


(a-l)“ (i I _+a±l_V“ ((a - 1 )to + dm + 2) 

“T {a-l)m J 


1 H —— = exp am log 1 H —— 


dm+2 ■ 


am 


am 


= exp am -FF + O 


am 


m^ 


(174) 


(175) 


(176) 


(177) 


( 178 ) 
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= exp [dm + 0 [ — 
m 


o ^ 


= e^™e 


= e'*™ ( 1 + O ( ^ 
m 


Similarly, 


In addition. 


{am + dm + 1)“ 


1 + 


^+l 


dm + 1 
(a — l)rTi 


(a— l)m 




1 + 0 ( ^ 


{{a — l)m + dm + \a—1 


+ 0 


dm 

m 


dm + l 


(o — l)m + dm + 2 


a — 1 


dm + l 


1 


(a — 1 )to + dm + 2 


l + 0( — 
m 


Using the same technique as in eq. (178), it can be verified that 

„+i , 

= e 


1 + 0 ( — 
m 


'd^ 

= ( 1 + 0 ( ^ 


Substituting eqs. (178) to (181) into eq. (177), we have 

(hm + 1)^™ + ' 

e-^- 



dm + 1 


1 


a — 1J (a — l)m + dm + 2 


1 + 0 


m 


a — 1 


dm + l 


1 


(a-l)r 


1 + 0 


^m 

m 


In addition. 



dm + l 


1 + 0 ( ^ 
m 


and therefore 


F, 


(&m) 


= Cm - ^ -(1 


T{m)T{bm — m + 2 ) 


= o„ 


a 


\a—l 


{a-l) 

(l-p)V™+i-™ ( 1 + 0 
= C„ 


d„ 


1 


1 


a 


a-lj (a - l)A/m V « “ 1 
dl 


m 


a“ 1 , \ 1 1 


(a - 1) 




TO (a — 1) V 27r 


(179) 

(180) 

(181) 

(182) 

(183) 

, (184) 

(185) 
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Appendix H 

Proof of Proposition 2 

First note that gp (a) is a continuous function of a satisfying limQ,_>.oo 9 p{ci) = —1. In addition, note that 

^9p (a) = (ffp(a) + 1) log ■ (186) 

For a = we have log = 0, and one might check that this point is indeed a maximum value, with gp = 1- 

Furthermore, for each a > amax, log < 0^ {gp{a) + 1) > 0, hence, we can conclude that gpia) has exactly one 

stationary point (maximum), at ctmax = pz^- 

Now, by the Intermediate Value Theorem, there exist exactly one Up € (xmaxjoo), such that g {ap,p) = 0. 

Remark 4. Note that, lim„_j,]^+ gp{a) = 1 — 2p, which implies that for 1/2 < p < 1, there is another root to g, at some 
point within the interval fl, p—l- 


Appendix I 
Proof of Lemma 10 


Similar to the proof in Proposition 2, we show by the Intermediate Value Theorem, that gp{a\M) > 0, for a = while, 
for a = + Cm, we have gp{a\M) < 0. Hence, the Intermediate Value Theorem guarantees that the root ap^M exist at 

some point within this interval. 

On the one hand, it can be verified that for a = we obtain pp(a|M) = 2 m — 1 > 0, for each M = 1,2,... On the 
other hand, using Taylor approximation for gp{a\M), at a = we have 


gp{a\M) = 2 m -l - 


+0 a — 


1-p 


Assign a = for s > 0, we have 

gp{a\M) =2^-1- 


2p 


-2 m 


(») log2 (1-p)^ 1 


M 


2p M2« 


1 

+ o( 


M2s 

,M3« J 

+ 0 

1 M2 

)+0 


1 


M1+2S / ’ 


(187) 

(188) 


(189) 


where in (*), we use again Taylor approximation, as M —> oo. Now, clearly see that if s < i, we have 

(l-p)2 1 


gpia\M) = -- 

for sufficiently large M, which complete the proof. 


2p M2s 


O 


< 0 , 


(190) 


Appendix J 

Proof of asymptotic distribution for normalized Y, for general AT-ary tree 

In this chaper, we show the existence of asymptotic distribution for normalized Y, i.e. Y — which base on the structure 
of Section VI-A. First, similar to Proposition 1, it can be easily seen, by Lemma 3 that for the suggested in eq. (123) 
satisfy 

lim n.Fc(M„|log^fc„,p) (191) 

n—foo 

for Un = bn + X, and to„ = 

Next, recall that the correlation between two random variable, is reflected by the number of shared channels, thus we can 
restate Lemma 4 for K child as: for i = 1,2,..., log;^ — 1, the joint distribution 

Pr(Yi > > Un) < - (CT'(log;^ (^C-^log^/cn^ + ^- (192) 

n \ I — p J 
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For Lemma 5 we show the complete proof: 

Lemma 13. The sequence (Yi,..., Y„) satisfy condition Df. (m„) 


Proof: Let 


k„/K 

= n ^ Pr {Xi > Un, Xi > Un) . 


( 193 ) 


Then by eq. (51), condition Dj, (un) satisfy if an —> 0, ai n —)■ oo. 

As mentioned before we can write the joint distribution Pr (Yi, Yi) as Pr (W + Zi,W + Zf), where W is random variable, 
represent the shared branches of the tree (dependent channels), ofYi,Yi and Zi, Zi as the independent branches (channels) 
ofYi,Yi respectively. 

For the general case, one might check that Yi have (K — \)K''~^ random variables who shared \ogj^ kn — i channels Then 
we can write q;„ as 


k„IK 

an =n ^ Pr (Yi > Y* > Un) 

i^2 

logit k-n-l 

=n YI {K - 1)K^-^ Pr (Yi > Un,YK^ > Un) 

i=l 

log^fc„-l / /A 

<n Y { - iC'P (log^ kn))'°’^’^ ( C- -log;^ kr 

U V 1-F 


(194) 


=2(C'4' (log^ kn)) 


logjf k„-l 


logjf fcn-l 




1 1 


2n2 


— ] 

nfaj 

logjf fe„-l 

3T Y 

i=l 


where in (a), we used eq. (192). Now, let g{n) = ^^d note that g{n) 0 as n ^ oo, then for large enough 

n, we have 


logit feri-i 

a„ <(C'4'(log^A:„))'°®^''""^ Y (din))" 


1 kn-K 

la -1 K -1 


= {C^{\0gj,kn)) 
<(C'4' (log^ kn)) 


logjf /c„-l g(^) I_ ^ 

l-g(n)^ ^ 

logjf fc„-l 5(tt) kn „ 


2a-l 


1 — g{n) 

as n ^ oo. since similar to Lemma 5, (C'A' (log^^ ^ 1, i^^[n) choose of kn, we obtain 


_f A 

„2a-i u 

The lemma was proved. 

Hence by eq. (191), and Lemma 13, it is possible to apply Theorem 2, i.e., for sufficiently large n. 


e-P"” < Pr(Y< w„) < e-P", 


(195) 


Appendix K 


Let (Ti,... ,T„) be the completion time for the n leaves, on multicast tree T„ with height h, h = 1,2,.... Hence, each 
node have exactly Ifn child nodes (i.e. Kn). Thus, T„ consider as Kn dependent, namely the sequence Ti, ... ,r„ can be 
parted into Kn sets, where each one of them contain niKn elements. Therefore condition D). (m„), for kn = ff- can be 
written as 


where 


k„ 

Dk^ (un) : limsupn^ Pr (Xi > u„, Xi > m„) = 0, 
1=2 


\ ^ ^ 

^ E ^ ^ ^ ” o” ) E Pr (Pl > lln, > Un) . 

i^2 ^ ^ 


(196) 


(197) 
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Clearly, the sum go to zero iff each element of the sum go to zero, and in particular for i = h — 1. Namely, 

where the pair (Ti,T^~^) shared one channel in common and h — 1 independent channels for each one, and therefore the 
number of pairs is Write the mutual channels as W where W follows geometric random variable, and be 

the independent channels of , respectively. As a result. We have 


( 1 ”) 5 S P’' (^1 >^n,T^-J> U^) 


for each h = 1, 2,.... 


> 


2 

K‘^ 

Kn-l 

2 

n 

2 

K‘^ 

Kn-l 

2 

n 

2 

K‘^ 

Kn-l 

2 

n 

2 

K‘^ 

nh - 1 

n^- 

2 

fl 

ni - 1 

v}~ 


(199) 


Pr {W + Zi> Un, fP + ^2 > Un) 
Pr {W>Un) 

p“" 


jh-l] 


h -1 


h -1 


1 — — 
n h 


(logn) 


h -1 


Let 


where 




M„ = max(Ti, 

is an i.i.d. sequence follow NB distribution with successes, where 


hn — 


logn 

logATn 


/i„ = 1, 2,..., and let u„ = &„ + x, where 

bn = cXv.K.. log n + log^ i/lognlogATn + /3 




( 200 ) 

( 201 ) 

( 202 ) 

(203) 


where ap^K^,b,P defined in Section VIII-B2, then 

lim sup Pr I M„ < Un) < e 

n—>-oo ^ ^ 

lim inf Pr (Mn < n„) > e~^ 

Proof: The constellation of the proofs will be similar to Section VI-B, hence, we write only sketch of the proof and 
necessary differences will be mention. 

First let present 6„ as 6„ = a logn + c„, where c„ satisfy —> 0, 

Pr (T, > 6„) = Cn Pr (2f = 5„ + 1) (204) 

where C„ = + o(l)> b = ^ ° i p. The prove is based on Lemma 7, and in particular we show that hypergeometric 

function F"(l, &„ + 1; 6„ — ft,„ + 2;p) is convergence to as n —)■ oo. 

OO ^ / 7, _l_ ■ 

^"(Ij + 1; (>n ~ + 2;p) = 1 + 


k=l j-1 


bn ~ hn + i + j 


(205) 
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oo 


=i+Ep'n 

fc=i j=i 


am + Cn + j 


Li V (a - ■^)n + c„ + 1 + j 


Hence the only change from the proof in Lemma 7 is that we write a — instead of a — 1, therefore we can define Fc- 


Fc ( bn ) =Cn 


r( 6 „ + 1) 


‘r(/i„)r(&„-/i„ + 2) 

Next, we show that Fc (bn) = ^ (1 + o (1)). 

Using Stirling’s approximation, it can be verified that 


(l-p)V 


hnrr^^rL — hn-\-l 


r(6n + 1) 


fb„ + l 


,K + 1 


2Tr 

bn + 1 


T(hn)T{bn -hn+2) l^ b„-hn+ 2 -^f>n-h „+2 ^ 


l + 0( — 

hn. 


= e 


hn-hn+2 

(bn + 1)^"“*"^ ^/hn^bn — + 2 

(bn -hn + 2)''"-^"+2 


i + o[ — 

hn. 


Assign bn = alogn + c„, (where Cn = O (log log n)) we obtain 

(bn + 1)'’" + ! 


h’i-(bn-hn+2)br.-hn+2 


((“- 


logK„ 


(alogn + c„ + 1) 


log n + c„ + 2 


a log n+Cji + 1 


(°- logg„ ) log»+c ^+2 


log n 


= e log Kn 


' alogn y 


a log n 


(“ log) 


log K-n 


1 + 


e „+2 


(“-TSife) '°g' 




(alogn + c„ + 1) 


c„ + l 


((‘ 


c„+2 • 


logn + c„ + 2j 

Based on the technique in Appendix G, eq. (178), it can be verified that 

a log n 


fl + 

' a log n y 


1 + 


c„+2 


(“-I55Tfir)'°g' 


(“-I5rW)l°S" 


= 1 + 0 


Vlogn 


and 


(alogn + c„ + 1) 


c„ + l 


c„ + l 


((a _ logn + c„ + 2 )'"^' V« - rai / (« - m) log^ + + 2 


1 + 0 


Vlogn 


In addition 


Hence by eqs. (207) to (211) 


bn — hn + 2 / O' 


&„ + 1 

r(^n + 1) 

T(hn)r(bh-h + 2) 

= I log7f„^- 


lo^ (1 + 0 

a 


logn 


log n 


Cn + 1 


(“ log) 


log K-n 


a — 


log if „ 


(206) 


(207) 


(208) 


(209) 


( 210 ) 


( 211 ) 


( 212 ) 
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I a — 


log Kn 






And therefore 


r(&n + 1) 


= C I logX™ 


log n 


Crr + 1 


(“ logV„ ) 


log K^i 


a — 


log-ff„ 


1 


(«- I5i^)log^ + Cn + 2 

(1 + 0 ( 1 )) 




log n 


= C logX^ 


(“ logV„ ) 


— - p°' logKn (l_p)log*:„ 


a/ 


(®“ I5i^)log^ + On + 2 V 27r 

log n 

writing n,„ = -, and by eq. (131) 

log Ar„ 


I? (1 + 0 ( 1 )) 



<5^" (1 + 0(1)) 


logn + o (logn) (1 + 0 (1)) 


= -( 1 + 0 ( 1 )) 

n 


and therefore for m„ = 5„ + x, for 6„ 


limsupPr (m„ < Un) < e ^ 

n—¥oo ^ ^ 

liminf Pr (Mn < Un) > e~^ 

n—^oo \ / 


which imply E 




^ bn - + 1 


( 213 ) 


(214) 
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